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XENOBIOTIC DETOXIFICATION GENE FROM PLANTS 

This application claims priority to U.S. 
60/101,814, filed September 25, 1998, the entirety of 
which is incorporated by reference herein. 

5 Pursuant to 35 U.S.C. §202 (c), it is 

acknowledged that the U.S. Government has certain rights 
in the invention described herein, which was made in part 
with funds from the National Science Foundation, Grant 
No. IBN-9416016. 

10 

FIELD OF THE INVENTION 

This invention relates to the field of stress 
resistance in plants. In particular, the invention 
provides a novel gene from plants, which encodes an MDR- 
15 like ABC transporter, involved in detoxification of 
certain xenobiotics to protect plants from their 
detrimental effects. 

BACKGROUND OF THE INVENTION 

20 Several publications are referenced in this 

application to describe the state of the art to which the 
invention pertains. Each of these publications is 
incorporated by reference herein. 

Environmental stress is one of the most 

25 important limitations on plant productivity, growth and 
survival. An ever-increasing source of environmental 
stress to plants is the stress caused by environmental 
pollutants in the soil, water and atmosphere. Such 
pollutants include herbicides, pesticides and related 

30 agronomic products, as well as organic and inorganic 

waste material from industry and other sources. Other 
toxic agents that threaten the survival of plants include 
various toxins produced by ephiphytic or soilborne 
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microorganisms/ such as fungi and bacteria. 

In order to survive in toxic environments, 
plants must have mechanisms to detoxify xenobiotics, 
heavy metals and other toxic compounds. This generally 
involves modification of the toxic compound and 
subsequent excretion into the vacuole or apoplastic 
space. Recently, certain ATP-binding cassette (ABC) 
transporters have been identified in plants, which appear 
to be involved in the detoxification process. 

The ABC transporter family is very large, with 
representatives existing in many different classes of 
organisms. Two well studied groups of ABC transporters, 
encoded by mdr and mrp genes, respectively, are 
associated with the multi-drug resistance phenomenon 
observed in mammalian tumor cells. The mdr genes encode 
a family of P-glycoproteins that mediate the energy- 
dependent efflux of certain lipophilic drugs from cells. 
The mrp genes encode a family of transporters that 
mediate the extrusion of a variety of organic compounds 
after their conjugation with glutathione. YCF1, the 
yeast homolog of mrp, encodes a protein capable of . 
glutathione-mediated detoxification of heavy metals. 

Horaologs of mrp and mdr genes have been 
identified in plant species. In Arabidopsis thaliana, 
the glutathione-con jugate transporter encoded by the mrp 
homolog is located in the vacuolar membrane and is 
responsible for sequestration of xenobiotics in the 
central vacuole (Tommasini et al., FEBS Lett. 411 ; 206- 
210, 1997; Li et al., Plant Physiol. 107: 1257-1268, 
1995) . An mdr-like gene {atpgpl) has also been 
identified in A. thaliana, which encodes a putative P- 
glycoprotein homolog. The atpgpl gene was found to share 
significant sequence homology and structural organization 
with human mdr genes, and was expressed with particular 
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abundance in inflorescence axes (Dudler & Hertig, J. 
Biol. Chem. 267 : 5882-5888, 1992). Other MDR homologs 
have been found in potato {Wang et al., Plant Mol. Biol. 
31: 683, 1996) and barley (Davies et al., Gene 199: 195, 
1997) . 

The aforementioned mrp and mdr plant homologs 
were identified as a result of an effort to understand 
the molecular basis for development in plants of cross- 
resistance to herbicides of unrelated classes. However, 
these transporters are likely to serve the more general 
role in plants of sequestering, secreting, or otherwise 
detoxifying various organic and inorganic xenobiotics. 
Accordingly, it will constitute an advance in the art of 
plant genetic engineering of stress tolerance to identify 
and characterize other members of this class of 
transporters in plants. 

SUMMARY OF THE INVENTION 

In accordance ,with the present invention, a new 
plant mdr homolog has been identified. Unlike the 
previously identified plant mdr homologs, this new gene 
is inducible by a class of compounds known to inhibit 
chloride ion channels. 

According to one aspect of the invention, a 
nucleic acid isolated from a plant is provided, which 
encodes a p-glycoprotein that is inducible by exposure of 
the plant to NPPB. The isolated nucleic acid is 
preferentially expressed in plant roots upon exposure of 
the plant to NPPB. In a preferred embodiment, the plant 
from which the nucleic acid is isolated is selected from 
the group consisting of Brassica napus and Arabidopsis 
thaliana and is 3850-4150 nucleotides in length. In a 
more preferred embodiment, the nucleic acid has the 
restriction sites shown in Figure 4 for at least three 
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restriction enzymes. In particularly preferred 
embodiments, the nucleic acid molecule encodes a 
polypeptide having SEQ ID NO: 2. In an exemplary 
embodiment, the nucleic acid is a cDNA comprising the 
5 coding region of SEQ ID NO:l or SEQ ID NO: 10. 

According to another aspect of the invention is 
an expression cassette that comprises a plPAC gene 
operably linked to a promoter, and in a more preferred 
embodiment the plPAC gene is from Arabidopsis. In 

10 preferred embodiments, the expression cassette comprises 
the cauliflower mosaic virus 35S promoter, and part of 
all of SEQ ID NO:l or SEQ ID NO: 10. Further included in 
this aspect is a vector comprising the expression 
cassette and a method for producing transgenic plants 

15 with the expression cassette and vector. 

Another aspect of the invention are transgenic 
cells and plants containing the nucleic acids of the 
invention. In one preferred embodiment, the nucleic 
acids are be in the aforementioned expression cassette. 

20 Furhter included in this aspect are reporductive units 
from the transgenic plant. 

According to another aspect of the invention, 
an isolated nucleic acid molecule is provided, which has 
a sequence selected from the group consisting of: a) SEQ 

25 ID NO:l and SEQ ID NO: 10; b) a nucleic acid sequence 
that is at least about 60% homologous to the coding 
regions of SEQ ID NO:l or SEQ ID NO: 10; c) a sequence 
hybridizing with SEQ ID NO:l or SEQ ID NO: 10 at moderate 
stringency; d) a sequence encoding part or all of a 

30 polypeptide having SEQ ID NO: 2; e) a sequence encoding an 
amino acid sequence that is at least about 70% identical 
to SEQ ID NO:2; f) a sequence encoding an amino acid 
. sequence that is at least about 80% similar to SEQ ID 
NO: 2; g) a sequence encoding an amino acid sequence that 
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is at least about 40% similar to residues 1-76, 613-669 
or 1144-1161 of .SEQ ID N0:2; and h) a sequence 
hybridizing at moderate stringency to a sequence encoding 
residues 1-76, 613-669 or 1144-1161 of SEQ ID NO:2. A 
5 polypeptide, produced by expression of the above listed 
sequences is also provided* 

According to another aspect of the invention, 
an isolated: plant p-glycoprotein, which is inducible upon 
exposure. of the plant to NPPB, is provided. The 

10 polypeptide preferably confers upon a cell in which it is 
found resistance to Rhodamine 6G. The polypeptide is 
preferentially produced in roots upon the exposure to the 
NPPB, The polypeptide is preferrably from Brassica napus 
or AraJbidopsis thaliana. In most preferred embodiments, 

15 the polypeptide has a sequence that is a) an amino acid 
sequence that is at least 80% similar to SEQ ID NO: 2; b) 
an amino acid sequence that is at least 70% identical to 
SEQ ID NO: 2; c) an amino acid sequence that is at least 
40% similar to residues 1-76, 613-669 or 1144-1161 of SEQ 

20 ID NO: 2; and d) an amino acid sequence encoded by a 

nucleic acid sequence hybridizing at moderate stringency 
to a amino acid sequence encoding residues 1-76, 613-669 
or 1144-1161 of SEQ ID N0:2. 

According to other aspects of the invention, 

25 antibodies immunologically specific for the polypeptides 
of the invention are provided, that immunologically 
specific to any of the polypeptides, of polypeptide 
encoded by the nucleic acids of the invention. In a 
preferred embodiment, the antibody is immunospecif ic to 

30 residues 1-76, 613-669 or 1144-1161 of SEQ ID NO:2. 

According to another aspect of the invention, a 
plant p-glycoprotein gene promoter, which is inducible by 
NPPB, is also provided. In a preferred embodiment, the 
promoter is part or all of residues 1-3429 of SEQ ID 
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N0:10. 

According to another aspect of the invention, 
plants that have reduces levels of plPAC protein are 
provided. In a preferred embodiment/ these plants have 
mutations in the plPAC gene, and in a particularly 
preferred embodiment, the plPAC gene is mutated due to 
the insertion of a T-DNA. Also provided with this aspect, 
is a method for selecting plants with mutations in plPAC 
using SEQ ID NOS: 11-14 as PCR primers. 

These and other features and advantages of the 
present invention will be described in greater detail in 
the description and examples set forth below. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. Amino acid sequence lineup of ATPAC 
deduced amino acid sequence and the amino acid sequences 
of related mammalian and plant genes. The lineup shows 
the ATPAC deduced amino acid sequence (SEQ ID NO: 2) 
compared with (1) hmdrl (SEQ ID NO: 3); "(2) mmdrl (SEQ ID 
NO: 4); (3) hmdr3 (SEQIDN0:5); (4) mmdr2 (SEQ ID NO:6); 
(5) atpgpl (SEQ ID N0:7); and (6) atpgp2 (SEQ ID N0:8). 
A consensus sequence (SEQ ID NO: 9) is also shown. 

Figure 2* Graph depicting the effect of 
rhodamine 6G on the growth rate of cells transformed with 
and expressing ATPAC as compared with control cells not 
containing ATPAC. 

Figure 3. Restriction map of genomic clone of 
ATPAC, SEQ ID NO: 10, 

Figure 4. Restriction map of cDNA clone of 
ATPAC, SEQ ID NO:l. 

DETAILED DESCRIPTION OF THE INVENTION 
I. Definitions 

Various terms relating to the biological 
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molecules of the present invention are used hereinabove 
and also throughout the specification and claims. 

With reference to nucleic acids of the 
invention, the term "isolated nucleic acid" is sometimes 
used. This term, when applied to DNA, refers to a DNA 
molecule that is separated from sequences with which it 
is immediately contiguous (in the 5' and 3 1 directions) 
in the naturally occurring genome of the organism from 
which it was derived. For example, the "isolated nucleic 
acid" may comprise a DNA molecule inserted into a vector, 
such as a plasmid or virus vector, or integrated into the 
genomic DNA of a procaryote or eucaryote. An "isolated 
nucleic acid molecule" may also comprise a cDNA molecule. 

With respect to RNA molecules of the invention 
the term "isolated nucleic acid" primarily refers to an 
RNA molecule encoded by an isolated DNA molecule as 
defined above. Alternatively, the term may refer to an 
RNA molecule that has been sufficiently separated from 
RNA molecules with which it would be associated in its 
natural state (i.e., in cells or tissues), such that it 
exists in a "substantially pure" form (the term 
"substantially pure" is defined below) . 

Nucleic acid sequences and amino acid sequences 
can be compared using computer programs that align the 
similar sequences of the nucleic or a m ino acids thus 
define the differences. For purposes of this invention, 
the DNAStar program (DNAStar, Inc., Madison, Wisconsin) 
and the default parameters used by that program are the 
parameters intended to be used herein to compare sequence 
identity and similarity. Alternately, the Blastn and 
Blastp 2.0 programs provided by the National Center for 
Biotechnology Information (at 

http: //www. ncbi.nlm. nih.gov/blast/ ; Altschul et al., 
1990, J Mol Biol 215:403-410) using a gapped alignment 
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with default parameters, may be used to determine the 
level of identity and similarity between nucleic acid 
sequences and amino acid sequences. 

The term "substantially the same" refers to 
5 nucleic acid or amino acid sequences having sequence 

variation that do not materially affect the nature of the 
protein (i.e. the structure/ thermostability 
characteristics and/or biological activity of the 
protein) . With particular reference to nucleic acid 

10 sequences, the term "substantially the same" is intended 
to refer to the coding region and. to conserved sequences 
governing expression, and refers primarily to degenerate 
codons encoding the same amino acid, or alternate codons 
encoding conservative substitute amino acids in the 

15 encoded polypeptide. With reference to amino acid 
sequences, the term "substantially the same" refers 
generally to conservative substitutions and/or variations ° 
in regions of the polypeptide not involved in 
determination of structure or function. 

20 The terms "percent identical" and "percent 

similar" are also used herein in comparisons among amino 
acid and nucleic acid sequences. When referring to amino 
acid sequences, * percent identical" refers to the percent 
of the amino acids of the subject amino acid sequence 

25 that have been matched to identical amino acids in the 
compared amino acid sequence by a sequence analysis 
program. % Percent similar" refers to the percent of the 
amino acids of the subject amino acid sequence that have 
been matched to identical or conserved ami, no acids . 

30 Conserved amino acids are those which differ in structure 
but are similar in physical properties such that the 
exchange of one for another would not appreciably change 
the tertiary structure of the resulting protein. 
Conservative substitutions are defined in Taylor (1986, 
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J. Theor. Biol. 119:205). When referring to nucleic acid 
molecules, m percent identical" refers to the percent of 
the nucleotides of the subject nucleic acid sequence that 
have been matched to identical nucleotides by a sequence 
5 analysis program. 

With respect to protein, the term "isolated 
protein" or "isolated and purified protein" is sometimes 
used, herein. This term refers primarily to a protein 
produced by expression of an isolated nucleic acid 
10 molecule of the invention. Alternatively, this term may 
refer to a protein which has been sufficiently separated 
from other proteins with which it would naturally be 
associated, so as to exist in "substantially pure" form. 
The term "substantially pure" refers to a 
15 preparation comprising at least 50-60% by weight the 
compound of interest (e.g., nucleic acid, 
oligonucleotide, protein, etc.). More preferably, the 
preparation comprises at least 75% by weight, and most 
preferably 90-99% by weight, the compound of interest. 
20 Purity is measured by methods appropriate for the 

compound of interest (e.g. chromatographic methods, 
agarose or polyacrylamide gel electrophoresis, HPLC 
analysis, and the like) . 

With respect to antibodies of the invention, 
25 the term "immunologically specific" refers to antibodies 
that bind to one or more epitopes of a protein of 
interest, but which do not substantially recognize and 
bind other molecules in a sample containing a mixed 
population of antigenic biological molecules. 
30 With respect to oligonucleotides, the term 

"specifically hybridizing" refers to the association 
between two single-stranded nucleotide molecules of 
sufficiently complementary sequence to permit such 
hybridization under pre-determined conditions generally 
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used in the art (sometimes termed "substantially 
complementary") . In particular, the term refers to 
hybridization of an oligonucleotide with a substantially 
complementary sequence contained within a single-stranded 
DNA or RNA molecule of the invention, to the substantial 
exclusion of hybridization of the oligonucleotide with 
single-stranded nucleic acids of non-complementary 
sequence. 

The term * expression cassette", as used herein, 
comprises 5 1 and 3' regulatory regions operably linked to 
a coding sequence. The coding sequence may be in the 
sense or antisense orientation with respect to the 5' 
regulatory region. 

the terra "promoter region" refers to the 5' 
regulatory regions of a gene. 

The term * reporter gene" refers to genetic 
sequences which may be operably linked to a promoter 
region forming a transgene, such that expression of the 
reporter gene coding region is regulated by the promoter 
and expression of the transgene is readily assayed. 

The term "selectable marker gene" refers to a 
gene product that when expressed confers a selectable 
phenotype, such as antibiotic resistance, on a 
transformed cell or plant. 

The term "operably linked" means that the 
regulatory sequences necessary for expression. of the 
coding sequence are placed in the DNA molecule in the 
appropriate positions relative to the coding sequence so 
as to effect expression of the coding sequence. This 
same definition is sometimes applied to the arrangement 
of coding sequences and transcription control elements 
(e.g. promoters, enhancers, and termination elements) in 
an expression vector. 

The term "DNA construct" refers to genetic 
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sequence used to transform plants and generate progeny 
transgenic plants. These constructs may be. administered 
to plants in a viral or plasmid vector. Other methods of 
delivery such as Agrobacterium T-DNA mediated 
5 transformation and transformation using the biolistic 
process are. also contemplated to be within the scope of 
the present invention.- The transforming DNA may be 
prepared according to standard protocols such as those 
set forth in "Current Protocols in Molecular Biology" , 
10 eds. Frederick M. Ausubel et al., John Wiley & Sons, 
1995. 

The term % xenobiotic" refers to foreign 
chemicals or agents not produced or naturally found in 
the organism. The term is commonly used in reference to 
15 toxic or otherwise detrimental foreign chemicals, such as 
organic pollutants or heavy metals. 



20 



II. Description of plPAC 

and its Encoded Polveptide 



In accordance with the present invention, a 
nucleic acid encoding a novel ATP-binding-cassette (ABC) 
transporter has been isolated and cloned from plants. 
The nucleic acid is referred to herein as plPAC. 

25 A cDNA clone of the plPAC from Arabidopsis 

thaliana, an exemplary plPAC of the invention, is 
described in detail herein and its nucleotide sequence is 
set forth in Example 1 as SEQ ID NO:l. This nucleic acid 
molecule is referred to as *ATPAC . It is 36% identical 

30 and 51% similar to human mdrl across the entire sequence. 
It is 51% identical to the atpgpl gene reported by Dudler 
& Hertig (1997, supra) and 50% identical to atpgp2, a 
close homolog of atpgpl, published in the Genbank . 
database. ATPAC protein is 65% similar to atpgpl and 

35 atpgp2 proteins. 
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A partial clone of a plPAC of the invention was 
originally isolated from Brassica napus via differential 
expression, screening of plants grown in the presence or 
absence of the chloride channel blocker/ 5-nitro-2- (3- 
phenylpropylamino) benzoic acid (NPPB) . A 0.5 kb gene 
fragment was identified, which had been up-regulated in 
response to NPPB treatment. This cDNA fragment was used . 
to screen an Arabidopsis cDNA library, from which the 
complete ATPAC clone was isolated. The isolation and 
characterization of ATPAC is described in Example 1. 

A genomic clone of ATPAC (SEQ ID NO: 10) has 
also been isolated from a bacterial artificial chromosome 
(BAC) library of the Arabidopsis genome (BAG clone IGF 
F3J22, obtained from the Arabidopsis stock center, Ohio 
State University) . A 7 kb fragment containing part of 
ATPAC and additional 5* regulatory sequences was 
subcloned into a plasmid vector (pBluescript) . A 
restriction map of ATPAC is found in Fig. 3. The 
corresponding cDNA clone of ATPAC is found in SEQ ID NO:l 
and its restriction map is Fig. 4. 

Among the unique features of this nucleic acid 
molecule as compared with other mdr-like genes from 
plants are its inducibility by certain compounds, 
including NPPB and herbicides, and its preferential 
expression in roots. The promoter regulatory region of 
ATPAC comprises residues 1-3429 of SEQ ID NO: 10. 

Although the ATPAC cDNA clone from Arabidopsis 
thaliana is described and exemplified herein, this 
invention is intended to encompass nucleic acid sequences 
and proteins from other plant species that are 
sufficiently similar to be used instead of ATPAC nucleic 
acid and proteins for the purposes described below. 
These include, but are not limited to, allelic variants 
and natural mutants of SEQ ID NO:l, which are likely to 
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be found in different species of plants or varieties of 
Arabidopsis. 

Because such variants are expected to possess 
certain differences in nucleotide and amino acid 
5 sequence, this invention provides an isolated plPAC 
nucleic acid molecule having at least about 60% 
(preferably 70% and more preferably over 80%) sequence 
homology in the coding regions with the nucleotide 
sequence set forth as SEQ ID NO:l or SEQ ID NO: 10 (and, 

10 most preferably, specifically comprising the coding 

region of SEQ ID N0:1) . Also provided are nucleic acids 
that encode a polypeptide that is at least about 40% 
(preferably 50% and most preferably 60%) similar to 
residues 1-76, 613-669 or 1144-1161 of SEQ ID NO:2. Also 

15 provided are nucleic acids that hybridize to the nucleic 
acids of SEQ ID NO:l, SEQ ID NO: 10, or nucleic acids 
encoding the regions of residues 1-76, 613-669 or 1144- 
1161 of SEQ ID NO: 2, preferably under moderate stringency 
(more preferably, high stringency, and most preferably, 

20 very high stringency) . 

In other preferred embodiments, the nucleic 
acids have a restriction digest map that is identical for 
at least 3 enzymes (more preferably 6 enzymes and most 
preferably ,9 enzymes) to the maps shown in Figs. 3 or 4. 

25 In another preferred embodiment, the nucleic acids have a 
restriction digest map identical to those shown in Fig. 3 
for enzymes Xhol, Xcml and Spel (preferably additionally 
Sad, Pad and Bsal, and most preferably additionally 
Acil, BanI and SnaBI) . In another preferred embodiment, 

30 the nucleic acids have a restriction digest map identical 
to those shown in Fig* 4 for enzymes Xbal, TatI and Neil 
. (preferably additionally Dral, BsmI and Bcil, and most 
preferably additionally AccI, Bsgl and TJil) . The 
nucleic acids of the invention are at least 20 nucleic 
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acids in length (preferably at least 50 nucleic acids and 
most preferably at least 100 nucleic acids) . . 

In accordance with the invention, novel plPAC 
genes from two plant species, Brassica napus and 
5 Arabidopsis thaliana, are presented. This constitutes 
the first description of this unique p-glycoprotein in 
plants. Indeed, the closest known protein sequence, also, 
from Arabidopsis, is only 65% identical suggesting that 
the ATPAC gene is novel and is expected to have novel 

10 properties. The isolation of two plPAC genes from 

different species enables the isolation of further plPAC 
genes from other plant species. Isolated .nucleic acids 
that are plPAC genes from any plant species are 
considered part of the instant invention. In particular, 

15 the nucleic acids of other plPAC genes -can be isolated 
using sequences of ATPAC that distinguish plPAC genes 
from other plant mdr genes according to methods that are 
well known to those in the art of gene isolation. In 
particular, sequences that encode residues 1-76, 613-669 

20 and 1144-1161 of SEQ ID N0:2 can be used. In a preferred 
embodiment, the plPAC gene is from any higher plant 
species (more preferred from a dicot species, and most 
preferred from a species in Brassicaceae (or 
Crucif erae) ) . 

25 . This invention also provides isolated 

polypeptide products of the open reading frames of SEQ ID 
NO:l or SEQ ID NO: 10, having at least about 70% 
(preferably 80% and most preferably 90%) sequence 
identity, or at least about 80% similarity (preferably 

30 90% and more preferably 95%) with the amino acid sequence 
of SEQ ID NO: 2. In another embodiment, the polypeptides 
of the invention are at least about 40% identical 
(preferably 50%, and most preferably 60%) to the regions 
of residues 1-7-6, 613-669 or 1144-1161 of SEQ ID NO: 2. 
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Because of the natural sequence variation likely to exist 
among plPAC genes, one skilled in the art would expect to 
find up to about 30-40% nucleotide sequence variation, 
while still maintaining the unique properties of the 
5 plPAC gene and encoded polypeptide of the present 

invention. Such an expectation is due in part to the 
degeneracy of the genetic code, as well as to the known 
evolutionary success of conservative amino acid sequence 
variations, which do not appreciably alter the nature of 

10 the encoded protein. Accordingly, such variants are 

considered substantially the same as one another and are 
included within the scope of the present invention* 

Also provided are transgenic plants transformed 
with part or all of the nucleic acids of the invention. 

15 Transgenic plants that over-express a plPAC coding 
sequence are one embodiment of this aspect of the 
invention. Example 3 provides for one prototype of such 
a plant. In a preferred embodiment, the ATPAC gene is 
used, and in a most preferred embodiment SEQ ID NO:l or 

20 SEQ ID NO: 10 is used. The plPAC gene may be placed under 
a powerful constitutive promoter, such as the Cauliflower 
Mosaic Virus (CaMV) 35S promoter or the figwort mosaic 
virus 35S promoter. In a preferred embodiment, the 
35SCaMV promoter is used. Transgenic plants expressing 

25 the plPAC gene under an inducible promoter (either its 
own promoter or a heterologous promoter) are also 
contemplated to be within the scope of the present 
invention. Inducible plant promoters include the 
tetracycline repressor /operator controlled promoter. In 

30 a preferred embodiment, a native plPAC promoter is used, 
and in a most preferred embodiment/ residues 1-3429 of 
SEQ ID NO: 10 is used. Plant species that are 
contemplated for overexpression of a plPAC coding 
sequence include, but are not limited to, soybean. 
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In another embodiment, overexpression of plPAC 
is induced to generate a co-suppression effect. This 
excess expression serves to promote down-regulation of 
both endogenous and exogenous plPAC genes. 
5 In some instances, it may be desirable to down- 

regulate or inhibit expression of endogenous plPAC in 
plants possessing the gene. Accordingly, plPAC nucleic 
acid molecules, or fragments thereof, may also be 
utilized to control the production of pJPAC-encoded P- 

10 glycoproteins. In one embodiment, full-length plPAC 
antisense molecules or antisense oligonucleotides, 
targeted to specific regions of pIPAC-encoded RNA that 
are critical for translation, are used. The use of 
antisense molecules to decrease expression levels of a 

15 pre-determined gene is known in the art. In a preferred 
embodiment, antisense molecules are provided in situ by 
transforming plant cells with a DNA construct which, upon 
transcription, produces the antisense sequences. Such 
constructs can be designed to produce full-length or 

20 partial antisense sequences. One example of antisense 
plPAC transgenic plants is given in Example 3. 

In another embodiment, knock-out plants are 
obtained by screening a T-DNA mutagenized plant 
population for insertions in the plPAC gene (see krysan 

25 et al., 1996, PNAS 93:8145). One example of this 
embodiment of the invention is found in Example 3. 
Optionally, transgenic plants can be created containing 
mutations in the region encoding the active site of 
plPAC. These last two embodiments are preferred over the 

30 use of anti-sense constructs due to the high homology 
among P-glycoproteins . 

The promoter of ATPAC is also provided in 
accordance with the invention. This promoter has the 
useful properties of root expression and inducability by 
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NPPB. The prototypic example of this aspect of the 
invention is residues 1-3429 of SEQ ID NO: 10. It is 
anticipated that plPAC genes from other plant species 
will likewise exhibit the aforementioned useful 
properties* As these promoter regions can easily be 
isolated from the plPAC genes that are provided with the 
invention, all plant plPAC gene promoters are provided 
with the invention. The nucleic acids of the invention 
therefore include a nucleic acid molecule that is at 
least about 70% identical (preferably 80% and most 
preferably 90%) to the residues 1-3429 of SEQ ID NO: 10. 
Also provided are nucleic acids that hybridize to the 
nucleic acid residues 1-3429 of SEQ ID NO: 10 preferably 
under moderate stringency (more preferably, high 
stringency, and most preferably, very high stringency) . 

The present invention also provides antibodies 
capable of immuno-specif ically binding to polypeptides of 
the invention. Polyclonal or monoclonal antibodies 
directed toward any of the peptides encoded by plPAC may 
be prepared according to standard methods. Monoclonal 
antibodies may be prepared according to general methods 
of Kdhler and Milstein, following standard protocols. In 
a preferred embodiment, antibodies are prepared, which 
react immuno-specif ically with various epitopes of the 
piPAC-encoded polypeptides. In a preferred embodiment, 
the antibodies are immunologically specific to the 
polypeptide of residues 1-76, 613-669 or 1144-1161 of SEQ 
ID NO: 2. 

The following description sets forth the 
general procedures involved in practicing the present 
invention. To the extent that specific materials are 
mentioned, it is merely for purposes of illustration and 
is not intended to limit the invention. Unless otherwise 
specified, general cloning procedures, such as those set 



WO 00/18886 



- 18 - 



PCI7US99/22363 ' 



forth in Sambrook et al . , Molecular Cloning , Cold Spring 
Harbor Laboratory (1989) (hereinafter "Sambrook et al.") 
or Ausubel et al. (eds) Current Protocols in Molecular 
Biology , John Wiley & Sons (1997) (hereinafter m Ausubel 
5 et al..") are used. 

III. Preparation of P1PAC Nucleic Acid Molecules, 

encoded Polypeptides, Antibodies Specific for the 
Polypeptides and Transgenic Plants 

10 

1. Nucleic Acid Molecules 

PlPAC nucleic acid molecules of the invention 
may be prepared by two general methods: (1) they may be 

15 synthesized from appropriate nucleotide triphosphates, or 
(2) they may be isolated from biological sources. Both 
methods utilize protocols well known in the art. 

The availability of nucleotide sequence 
information/ such as the cDNA having SEQ ID NO:l, enables 

20 preparation of an isolated nucleic acid molecule of the 
invention by oligonucleotide synthesis. Synthetic 
oligonucleotides may be prepared by the phosphoramadite 
method employed in the Applied Biosystems 38A DNA 
Synthesizer or similar devices. The resultant construct 

25 may be purified according to methods known in the art, 
such as high performance liquid chromatography (HP-LC) . 
Long, double-stranded polynucleotides, such as a DNA 
molecule of the present invention, must be synthesized in 
stages, due to the size limitations inherent in current 

30 oligonucleotide synthetic methods. Thus, for example, a 
long double-stranded molecule may be synthesized as 
several smaller segments of appropriate complementarity. 
Complementary segments thus produced may be annealed such 
that each segment possesses appropriate cohesive termini 

35 for attachment of an adjacent segment. Adjacent segments 
may be ligated by annealing cohesive termini in the 
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presence of DNA ligase to construct an entire long 
double-stranded molecule. A synthetic DNA molecule so 
constructed may then be cloned and amplified in an 
appropriate vector, 
5 P1PAC genes also may be isolated from 

appropriate biological sources using methods known in the 
art. In fact, the ATPAC clone was isolated from an 
Arabidopsis cDNA library using a partial clone obtained 
from Brassica napus. In alternative embodiments, genomic 

10 clones of plPAC may be isolated. 

In accordance with the present invention, 
nucleic acids having the appropriate level sequence 
homology with part or all the coding regions of SEQ ID 
N0:1 or SEQ ID NO: 10 may be identified by using 

15 hybridization and washing conditions of appropriate 
stringency. For example, hybridizations may be 
perf ormed, according to the method of Sambrook et al . , 
using a hybridization solution comprising: 5X SSC, 
5X Denhardt's reagent, 1.0% SDS, 100 pg/ml denatured, 

20 fragmented salmon sperm DNA, 0.05% sodium pyrophosphate 

and up to 50% formamide. .Hybridization is carried out at 
37-42°C for at least six hours. Following hybridization, 
filters are washed as follows: (1) 5 minutes at room 
temperature in 2X SSC and 1% SDS; (2) 15 minutes at room 

25 temperature in 2X SSC and 0.1% SDS; (3) 30 minutes-1 

hour at 37 °C in 2X SSC and 0.1% SDS; (4) 2 hours at 45- 
55°in 2X SSC and 0.1% SDS, changing the solution every 30 
minutes. 

One common formula for calculating the 
30 stringency conditions required to achieve hybridization 
between nucleic acid molecules of a specified sequence 
homology (Sambrook et al., 1989): 

T m = 81.5°C + 16.6Log [Na+] + 0.41(% G+C) - 0.63 (% foxmamide) - 600/#bp in duplex 
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As an illustration -of the above formula, using [N+] = 
[0.368] and 50% formamide/ with GC content of 42% and an 
average probe size of 200 bases, the T n is 57°C. The T B 
of a DNA duplex decreases by 1 - 1.5°C with every 1% 
5 decrease in homology. Thus, targets with greater than 
about 75% sequence identity would be observed using a 
hybridization temperature of 42 °C. 

The stringency of the hybridization and wash 
depend primarily on the salt concentration and 

10 temperature of the solutions* In general, to maximize 
the rate of annealing of the probe with its target, the 
hybridization is usually carried out at salt and 
temperature conditions that are 20-25 °C below the 
calculated T B of the of the hybrid. Wash conditions 

15 should be as stringent as possible for the degree of 

identity of the probe for the target. In general, wash 
conditions are selected to be approximately 12-20°C below 
the T B of the hybrid. In regards to the nucleic acids of 
the current invention, a moderate stringency 

20 hybridization is defined as hybridization in 6X SSC, 5X 
Denhardt's solution, 0.5% SDS and 100 ug/ml denatured 
salmon sperm DNA at 42 °C, and wash in 2X SSC and 0.5% SDS 
at 55°C for 15 minutes. A high stringency hybridization 
is defined as hybridization in 6X SSC, 5X Denhardt's 

25 solution, 0.5% SDS and 100 jig/ml denatured salmon sperm 
DNA at 42°C, and wash in IX SSC and 0.5% SDS at 65°C for 
15 minutes. A very high stringency hybridization is 
defined as hybridization in 6X SSC, 5X Denhardt's 
solution, 0.5% SDS and 100 pg/ml denatured salmon sperm 

30 DNA at 42°C, and wash in 0.1X SSC and 0.5% SDS at 65°C 
for 15 minutes. 

Nucleic acids of the present invention may be 
maintained as DNA in any convenient cloning vector. In a 
preferred embodiment, clones are maintained in plasmid 
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cloning/expression vector, such as pGEM-T (Promega 
Biotech, Madison, WI) or pBluescript (Stratagene, La 
Jolla, CA) , either of which is propagated in a suitable 
E. coli host cell. 
5 P1PAC nucleic acid molecules of the invention 

include cDNA, genomic DNA, RNA, and fragments thereof 
which may be single- or double-stranded. Thus, this 
invention provides oligonucleotides (sense or antisense 
strands of DNA or RNA) having sequences capable of 

10 hybridizing with at least one sequence of a nucleic acid 
molecule of the present invention, such as selected 
segments of SEQ ID N0:1 or SEQ ID NO:10. Such 
oligonucleotides are useful as probes for detecting plPAC 
genes or mRNA in test samples, e.g. by PCR amplification, 

15 mapping of genes or for the positive or negative 

regulation of expression of plPAC genes at or before 
translation of the mRNA into proteins. 

The plPAC promoter is also expected to be 
useful in connection with the present invention, inasmuch 

20 as it is inducible in plants upon exposure to anion 

channel blockers. As mentioned above, seven-kilobase 
fragment of genomic DNA has been isolated, which contains 
part or all of the plPAC promoter from Arabidopsis 
thaliana. This promoter can be used in chimeric gene 

25 constructs to facilitate inducible expression of any 
coding sequence of interest, upon exposure to NPPB or 
similar-acting compounds. 

2. Proteins 

30 Polypeptides encoded by plPAC nucleic acids of 

the invention may be prepared in a variety of ways, 
according to known methods. If produced in situ the 
polypeptides may be purified from appropriate sources, 
e.g., plant roots or other plant parts. 
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Alternatively, the availability of nucleic acid 
molecules encoding the polypeptides enables production of 
the proteins using in vitro expression methods known in 
the art. For example; a cDNA or gene may be cloned into 
5 an appropriate in vitro transcription vector, such a 
pSP64 or pSP65 for in vitro transcription, followed by 
cell-free translation in a suitable cell-free translation - 
system, such as wheat germ or rabbit reticulocytes. In 
vitro transcription and translation systems are 

10 commercially available, e.g., from Promega Biotech, 
Madison, Wisconsin or BRL, Rockville, Maryland. 

According to a preferred embodiment, larger 
quantities of plPAC-encoded polypeptide may be produced 
by expression in a suitable procaryotic or eucaryotic 

15 system. For example, part or all of a DNA molecule, such 
as the cDNA having SEQ ID NO:l, may be inserted into a 
plasmid vector adapted for expression in a bacterial cell 
{such as E. coli) or a yeast cell (such as Saccharomyces 
cerevisiae) , or into a baculovirus vector for expression 

20 in an insect cell. Such vectors comprise the regulatory 
elements necessary for expression of the DNA in the host 
cell, positioned in such a manner as to permit expression 
of the DNA in the host cell. Such regulatory elements 
required for expression include promoter sequences, 

25 transcription initiation sequences and, optionally, 
enhancer sequences. 

The plPAC polypeptide produced by gene 
expression in a recombinant procaryotic or eucyarotic 
system may be purified according to methods known in the 

30 art. In a preferred embodiment, . a commercially available 
expression/secretion system can be used, whereby the 
recombinant protein is expressed and thereafter secreted 
from the host cell, to be easily purified from the 
surrounding medium. If expression/secretion vectors are 
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not used, an alternative approach involves purifying the 
recombinant protein by affinity separation, such as by 
immunological interaction with antibodies that bind 
specifically to the recombinant protein. Such methods 
5 are commonly used by skilled practitioners. 

The plPAC-encoded polypeptides of the 
invention, prepared by the aforementioned methods, may be . 
analyzed according to standard procedures . 

10 C. Transgenic Plants 

Transgenic plants expressing the plPAC gene can 
be generated using standard plant transformation methods 
known to those skilled in the art. These include, but 
are not limited to, Agrobacterium vectors, PEG treatment 

15 of protoplasts, biolistic DNA delivery, UV laser 

microbeam, gemini virus vectors, calcium phosphate 
treatment of protoplasts, electroporation of isolated 
protoplasts, agitation of cell suspensions with 
microbeads coated with the transforming DNA, direct DNA 

20 uptake, liposorae-mediated DNA uptake, and the like. Such 
methods have been published in the art. See, e.g., 
Methods for Plant Molecular Biology (Weissbach & 
Weissbach, eds., 1988) ; Methods in Plant Molecular 
Biology (Schuler & Zielinski, eds., 1989); Plant 

25 Molecular Biology Manual (Gelvin, Schilperoort, Verma, 
eds., 1993); and Methods in Plant Molecular Biology - A 
Laboratory Manual (Maliga, Klessig, Cashmore, Gruissem & 
Varner, eds., 1994). 

The method of transformation depends upon the 

30 plant to be transformed. The biolistic DNA delivery 

method is useful for nuclear transformation. In another 
embodiment of the invention, Agrobactexium vectors are 
used to advantage for efficient transformation of plant 
nuclei . 
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In a preferred embodiment/ the gene is 
introduced into plant nuclei in Agrobacterium binary 
. vectors- Such vectors include , but are not limited to, 
BIN19 (Bevan, 1984, Nucleic Acid Res 12: 8711-8721) and 
5 derivatives thereof, the pBI vector series (Jefferson et 
al., 1987, PNAS 83:8447-51), and binary vectors pGA482 
and pGA492 (An, 1986) and others (for review, see An, 
1995, Methods Mol Biol 44:47-58). In preferred 
embodiments, the pPZP211 vector (Hajdukiewicz et al,, 

10 1994, PMB 25:989-994) or PCGN7366 (Calgene, CA) are used. 
DNA constructs for transforming a selected plant comprise 
a coding sequence of interest operably linked to 
appropriate 5' (e.g., promoters and translational 
regulatory sequences) and 3' regulatory sequences (e.g., 

15 . terminators). 

Using an Agrobacterium binary vector system for 
transformation, the plPAC coding region, under control of 
a constitutive or inducible promoter as described above, 
is linked to a nuclear drug resistance marker, such as 

20 kanamycin resistance. Agrobacterium-mediated 

transformation of plant nuclei is accomplished according 
to the following procedure: 

(1) the gene is inserted into the selected 
Agrobacterium binary vector; 

25 (2) transformation is accomplished by co- 

cultivation of plant tissue (e.g., leaf discs) with a 
suspension of recombinant Agrobacterium, followed by 
incubation (e.g., two days) on growth medium in the 
absence of the drug used as the selective medium (see, 

30 e.g., Horsch et al. 1985, Cold Spring Harb Symp Quant 
Biol. 50:433-7); 

(3) plant tissue is then transferred onto the 
selective medium to identify transformed tissue; and 

(4) identified transformants are regenerated 
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to intact plants. 

It should be recognized that the amount of 
expression, as well as the tissue specificity of 
expression of the plPAC gene in transformed plants can 
5 vary depending on the position of their insertion into 

the nuclear genome. Such position effects are well known 
in the art. For this reason, several nuclear 
transformants should be regenerated and tested for 
expression of the transgene. 

10 

IV. Uses of PlPAC Nucleic Acids, Encoded Proteins 
and Antibodies 

1 . PlPAC Nucleic Acids 

15 PlPAC nucleic acids may be used for a variety 

of purposes in accordance with the present invention. 
The DNA, RNA, or fragments thereof may be used as probes 
to detect the presence . of and/or expression of plPAC 
genes. Methods in which plPAC nucleic acids may be 

20 utilized as probes for such assays include, but are not 
limited to: (1) in situ hybridization; (2) Southern 
hybridization (3) northern hybridization; and (4) 
assorted amplification reactions such as polymerase chain 
reactions (PCR) . 

25 The plPAC nucleic acids of the invention may 

also be utilized as probes to identify related genes from 
other plant species. As is well known in the art and 
described above, hybridization stringencies may be 
adjusted to allow hybridization of nucleic acid probes 

30 with complementary sequences of varying degrees of 
homology. Thus, plPAC nucleic acids may be used to 
advantage to identify and characterize other genes of 
varying degrees of relation to the exemplary ATP AC, 
thereby enabling further characterization of this family 
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of genes in plants. Additionally, they may be used to 
identify genes encoding proteins that interact with the 
P-glycoprotein encoded by plPAC (e.g., by the 
"interaction trap" technique) . 

2. PlPAC Proteins and Antibodies 

Purified pi PAC- encoded P-glycoproteins, or 
fragments thereof, may be used to produce polyclonal or 
monoclonal antibodies which also may serve as sensitive 
detection reagents for the presence and accumulation of 
plant P-glycoproteins in cultured plant cells or tissues 
and in intact plants* Recombinant techniques enable 
expression of fusion proteins containing part or all of 
the plPAC-encoded protein. The full length protein or 
fragments of the protein may be used to advantage to 
generate an array of monoclonal or polyclonal antibodies 
specific for various epitopes of the protein, thereby 
providing even greater sensitivity for detection of the 
protein in cells or tissue-. 

Polyclonal or monoclonal antibodies 
immunologically specific for plPAC-encoded proteins may 
be used in a variety of assays designed to detect and 
quantitate the protein. Such assays include, but are not 
limited to: (1) flow cytometric analysis; (2) 
immunochemical localization in cultured cells or tissues; 
and (3) immunoblot analysis (e.g., dot blot. Western 
blot) of extracts from various cells and tissues. 

Polyclonal or monoclonal antibodies that 
immunospecif ically interact with one or more of the 
polypeptides encoded by plPAC can be utilized for 
identifying and purifying such proteins. For example, 
antibodies may be utilized for affinity separation of 
proteins with which they immunospecif ically interact. 
Antibodies may also be used to immunoprecipitate proteins 
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from a sample containing a mixture of proteins and other 
biological molecules, 

3. plPAC Transgenic Plants 
5 Transgenic plants that over- or under- express 

plPAC can be used in a varied of agronomic and research 
applications. From the foregoing discussion, it can be 
seen that plPAC and its homologs, and transgenic plants 
containing them will be useful for improving stress 

10 resistance or tolerance in plants. This provides an 

avenue for developing marginal or toxic soil environments 
for crop production. Both over- and under-expressing 
plPAC transgenic plants have great utility in the 
research of herbicides and other xenobiotic compounds. 

15 As discussed above and in greater detail in 

Example 1/ the similarity between plant and mammalian mdr 
genes indicates that their functional aspects will also 
be conserved. Thus, plPAC is expected to play an 
important, role in the exclusion of toxic metabolic or 

20 xenobiotic compounds from cells. The fact that plPAC 
also is inducible and appears to be preferentially 
expressed in roots, where contact with such compounds 
often occurs, makes plPAC particularly desirable for 
genetic engineering of plants to increase their tolerance 

25 to such compounds . Accordingly, plants engineered to 

overexpress the plPAC gene should be resistant to a wide 
range of chemicals, both intentionally applied as 
herbicides or unintentionally as wastes. Examples of the 
kinds of xenobiotics that should be detoxified by the 

30 plPAC of the invention include, but are not. limited to, 
hydrophobic (i.e., lipophilic) herbicides and other 
compounds, such as 3 (3, 4-dichlorophenyl) -1, 1, dimethyl 
urea (also known as DCMU or Diuron, available from Sigma 
Chemical Co., St. Louis, MO) or other hydrophobic 
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compounds that disrupt photosynthetic electron transport, 
as well as Metachlor (Ciba Geigy, Basel Switzerland) , 
Taurocholate (Sigma Chemical Co. ) , Primisulfuron (Ciba 
Geigy), and IRL-1803. 

As illustrated in Example 2, plant cells that 
over-express a plPAC. gene have . surprisingly higher growth 
rate with or without the xenobiotic compound Rhodamine 
6G. It is contemplated that plPAC overexpression may be 
a generally useful way to increase plant and plant cell 
culture growth, even without the presence of xenobiotic 
compounds. 

The following specific examples are provided to 
illustrate embodiments of the invention. They are not 
intended to limit the scope of the invention in any way. 



EXAMPLE 1 
Cloning and Analysis of a 
PlPAC From AraJbidopsis thallana 

The plPAC of the present invention was 
identified by its up-regulation in response to a chloride 
ion channel blocker. Brassica napus plants were grown 
either in the presence or absence of 20 yM 5-nitro-2- (3- 
phenylpropyl amino) benzoic acid (NPPB) . After five days, 
the roots of the seedlings were harvested and total RNA 
was extracted separately from the treated and untreated 
plants- From the total RNA preparations, poly (A)+ RNA 
was isolated and used as the starting material to create 
a cDNA subtraction library, using the CLONTECH PCR- 
S ELECT 1 * cDNA Subtraction Kit and accompanying 
instructions (CLONTECH Laboratories, Inc., Palo Alto, 
CA) . 
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Using the subtractive hybridization kit, a gene 
fragment was identified that was up-regulated in response 
to treatment of the plants with NPPB. This fragment (0.5 
kb) was used to screen a cDNA. library of Arabidopsis 
5 thaliana, from which a full-length cDNA clone was 

isolated.. The nucleotide sequence of this cDNA clone, 
referred to as ATPAC {Arabidopsis thaliana putative anion - 
channel) is set forth below as SEQ ID NO:l. 

The 3.76 kb cDNA clone encodes a polypeptide 

10 1,254 amino acids in length. The deduced amino acid 

sequence encoded by SEQ ID NO:l is shown in Figure 1 as 
m atpac" (SEQ ID NO:2), in a lineup with the following 
sequences: (1) hmdrl (SEQ ID NO:3); (2) mmdrl (SEQ ID 
NO: 4); (3) hmdr3 (SEQ ID NO: 5); (4) mmdr2 (SEQ ID NO: 6); 

15 (5) atpgpl (SEQ ID N0:7); and (6) atpgp2 (SEQ ID N0:8). 
A consensus sequence (SEQ ID NO: 9) is also shown. 

A search of various sequence databases 
indicates that ATPAC is a new and distinct member of the 
mdr family of ABC transporters. In none of the 

20 databases, including the EST collection, does an exact 
match exist. The ABC transporter family is very large, 
consisting of at least two sub-groups, mzp and homologs 
and mdr and homologs. The only examples of plant mdr- 
like genes are atpgpl and atpgp2 from A. thaliana and two 

25 homologs from potato and barley, respectively. Though 
the atpgpl and atpgp2 genes are -similar to ATPAC, they 
are only 51 and 50% identical, respectively, indicating 
that ATPAC is a distinct gene by comparison. Sequence 
homology with the potato and barley mdr-like genes is 

30 even more divergent. Another difference between the 
agpgpl gene and the ATPAC gene is their respective 
preferential expression in inflorescens and roots, 
respectively. 
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EXAMPLE 2 

Effect of ATPAC Expression in Bacterial Cells 
on Their Ability to Detoxify Rhodamine 6G 

The compound Rhodamine 6G is a well known 
substrate of mammalian p-glycoproteins (Kolaczkowski et 
al., J. Biol. Chem. 271: 31543-31548, 1996). The ability 
of a cell to detoxify the compound is indicative of 
activity of p-glycoproteins. A bacterial cell line was 
transformed with an expression vector comprising ATPAC< 
The growth rate of transformed and non-transformed cells 
was then measured/ in the presence or absence of 
Rhodamine 6G. Results are shown in Figure 2. As can be 
seen, ATPAC-expressing cells grown in the absence of the 
drug had the best growth rate. Moreover, even in the 
presence of the drug, the cells grew more quickly than 
non-transformed cells in the presence or absence of 
Rhodamine 6G. These results demonstrate that ATPAC 
encodes a functional and robust p-glycoprotein. 

Example 3 
Transgenic Plants the Overexpress 
and Underexpress ATPAC 

Transformation construct. The Agrobacterium 
binary vector pPZP211 (Hajdukiewicz et al., 1994 Plant 
Mol. Biol. 25:989-994) was digested with EcoFl and Smal, 
and self-ligated. This molecule was named pPZP211'. The 
Agrobacterium binary vector pCGN7366 {Calgene, CA) was 
digested with Xhol and cloned in Sail-digested pPZP211'. 
We named this binary vector pPZP-PCGN. The 3.8 kb full- 
length ATPAC cDNA was cloned into the pGH19 vector. 
After digestion with Smal (in the multiple cloning site 
upstream) and EcoRX, a 3.1 kb cDNA fragment was cut out. 
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This Smal-EcoRI 3.1 kb fragment was cloned into the 
Smal/EcoRI site of pPZP-pCGN. The rest of ATPAC gene was 
amplified using polymerase chain reaction to have 
translationally fused HA-tag at its 3' -terminal. After 
5 ligating £co<Rl linkers to the ends of the resulting PCR 
product, the 0.7 kb fragment was cloned into the EcoKI 
site of the Smal-EcoRI 3.1 kb ATPAC fragment in pPZP- 
pCGN. The final construct was named pATPAC-OE. 

Plant transformation. pATPAC-OE was introduced 

10 into Agrobacterium tumefaciens strain by a direct 
transformation method. Agrobacteri um-medi a ted 
transformation was performed using vacuum infiltration 
(Bechtold et al., 1993,. CR Acad. Sci. [Ill] 316: 
1194-1199.) 

15 Tl plants which survived on 

kanamycin-containing plates were selected, transplanted 
into soil and grown to set T2 seed. T3 seeds were 
collected from kanamycin-resistant T2 plants. T3 plants 
which showed 100% kanamycin-resistance were selected and 

20 were considered homozygous for the transgene. 

Antisense Plants. The full length cDNA in 
pBluescript SK(-) vector (Stratagene, CA) is digested 
with EcoRI (there is a cleavage site in the upstream 

25 polylinker) and Sspl. The resulting 1.3 Kb fragment 

representing a 5' portion of the AtPAC cDNA was cloned 
into the aforementioned pPZP-PCGN, which had been 
digested with EcoEl/Smal, ensuring that this fragment of 
the cDNA was inserted in the antisense orientation. This 

30 construct was named pATPAC-AE. pATPAOAE was introduced 
into Arabidopsis plants by Agrobacterium transformation, 
as described above. 
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Knock-out Plants. The method of Krysan et al 
(1996, PNAS 93:8145, incorporated by reference herein) 
was followed using the following primers: 
Gene-specific primers: 
5 AtpacF: GACTGCTCAATGATCTCGTTTTCTCACTA (SEQ ID NO: 11) 
AtpacR: CTTGAATCACACCAATGCAATCAACACCTC (SEQ ID NO: 12) 
Primers for T-DNA left boarder: 

JL202: CATTTTATAATAACGCTGCGGACATCTAC (SEQ ID NO: 13) 
JL270: TTTCTCCATATTGACCATCATACTCATTG (SEQ ID NO: 14) 

10 

While certain of the preferred embodiments of 
the present invention have been described and 
specifically exemplified above, it is not intended that 
the invention be limited to such embodiments. Various 
15 modifications may be made thereto without departing from 
the scope and spirit of the present invention, as set 
forth in the following claims. 
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What is claimed: 

1. A nucleic acid isolated from a plant, which 
encodes a p-glycoprotein that is inducible by exposure of 

r. 5 the plant to NPPB. 

2. The isolated nucleic acid of claim 1, which . 
is preferentially expressed in plant roots upon exposure 
of the plant to NPPB. 

10 

3. The isolated nucleic acid of claim 1, 
wherein the plant is selected from the group consisting 
of Brassica napus and Arabidopsis thaliana and is 3850- 
4150 nucleotides long. 

15 

4. The isolated nucleic acid of claim 1, which 
has the restriction sites shown in Figure 4 for at least 
three enzymes. 

20 5. The isolated nucleic acid of claim 4, which 

encodes a polypeptide having SEQ ID NO: 2. 

6. The isolated nucleic acid of claim 5, which 
is a cDNA comprising a coding region selected from the 

25 group consisting of SEQ ID N0:1 and SEQ ID NO: 10. 

7 . An isolated protein, which is a product of 
expression of part or all of the isolated nucleic acid 
molecule of claim 1. 

30 

8. Antibodies immunologically specific for the 
protein of claim 7. 

9. A expression cassette, which comprises a 
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plPAC gene coding sequence operably linked to a promoter. 



10. The expression cassette of claim 9, which 
comprises a plPAC gene from Arabidopsis thaliana. 

5 

11. The expression cassette' of claim 10, in 
which the promoter is the cauliflower mosaic virus 35S 
promoter. 

12. The expression cassette of claim 10, in 
plPAC gene is part or all of SEQ ID N0:1 or SEQ 

13. A vector comprising the expression 
15 cassette of claim 9. 

14. The vector of claim 13, which is comprised 
of an Agrobacterium binary vector selected from the group 
consisting of pPZP211 and pCGN7366. 

15. A method for producing a plant with 
enhanced resistance to xenobiotic compounds by 
transforming in vitro the plant with the expression 
cassette of claim 9. 

16. The method of claim 15, wherein the 
transformation step further uses the vector of claim 13. 

17. A transgenic plant produced by the method 
30 of claim 15. 

18. A reproductive unit form the transgenic 
plant of claim 17. 



10 

which the 
ID NO: 10. 



20 



25 
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19.. A cell from the transgenic plant of claim 

17. 

20. A recombinant DNA molecule comprising the 
5 nucleic acid molecule of claim 1, operably linked to a 

vector for transforming cells. 

21. A cell transformed with the recombinant' 
DNA molecule of claim 20. 

10 

22. The cell of claim 21, selected from the 
group consisting of bacterial cells, yeast cells and 
plant cells. 

15 23. A transgenic plant regenerated from the 

transformed cell of claim 22. 

24. An isolated nucleic acid molecule of at 
least 20 nucleotides in length having a sequence selected 
20 from the group consisting of: 

a) SEQ ID N0:1 and SEQ ID NO: 10; 

b) a nucleic acid sequence that is at least 
about 60% homologous to the coding regions of SEQ ID NO:l 
or SEQ ID NO: 10; 

25 c) a sequence hybridizing with SEQ ID NO:l or 

SEQ ID NO: 10 at moderate stringency; 

d) a sequence encoding part or all of a 
polypeptide having SEQ ID NO: 2; 

30 that is 

that is 

that is 



e) a sequence encoding an amino acid sequence 
at least about 70% identical to SEQ ID NO: 2; 

f) a sequence encoding an amino acid sequence 
at least about 80% similar to SEQ ID NO: 2; 

g) a sequence encoding an amino acid sequence 
at least about 40% similar to residues 1-76, 613- 
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669 or 1144-1161 of SEQ ID NO: 2; and 

h) a sequence hybridizing at moderate 
stringency to a sequence encoding residues 1-76, 613-669 
or 1144-1161 of SEQ ID N0:2. 

5 

25. A polypeptide produced by expression of the 
nucleic acid sequence of claim 24. 

.26. Antibodies immunologically specific for the 
10 polypeptide of claim 24. 

27. An oligonucleotide between about 10 and 
about 100 nucleotides in length, which specifically 
hybridizes at moderate stringency with a portion of the 

15 nucleic acid molecule of claim 24. 

28. A recombinant DNA molecule comprising the 
nucleic acid molecule of claim 24, operably linked to a 
vector for transforming cells. 

20 

29. A cell transformed with the recombinant DNA 
molecule of claim 28. 

30. The cell of claim 29, selected from the 

25 group consisting of bacterial cells, yeast cells and plant 
cells. 

31. A transgenic plant regenerated from the 
cell of claim 30. 



30 



32. An isolated plant p-glycoprotein, which is 
inducible upon exposure of the plant to NPPB. 

33. The p-glycoprotein of claim 32, which 
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confers upon a cell in which it is . found resistance to 
Rhodamine 6G. 

34. The p-glycoprotein of claim 33, which is 
preferentially produced in roots upon the exposure to the 
NPPB. 



35. The p-glycoprotein of claim 34, from a 
plant selected from the group consisting of Brassica napus 

10 and Arabidopsis thaliana. 

36. The p-glycoprotein of claim 35, having an 
amino acid sequence that selected from the group 
consisting of: 

15 a) an amino acid sequence that is at least 80% 

similar to SEQ ID NO: 2; 

b) an amino acid sequence that is at least 70% 
identical to SEQ ID NO: 2; 

c) an amino acid sequence that is at least 40% 
20 similar to residues 1-76, 613-669 or 1144-1161 of SEQ ID 

NO:2; and 

d) an amino acid sequence encoded by a nucleic 
acid sequence hybridizing at moderate stringency to a 
amino acid sequence encoding residues 1-76, 613-669 or 

25 1144-1161 of SEQ ID N0:2. 

37. Antibodies immunologically specific for the 
p-glycoprotein of claim 32. 

30 38 The antibodies of claim 35, that are 

immunologically specific to residues 1-76, 613-669 or 
1144-1161 of SEQ ID NO:2. 

39. A plant p-glycoprotein gene promoter which 
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is inducible by NPPB. 

40. The plant p-glycoprotein gene promoter of 
claim 39, that is part or all of residues 1-3429 of SEQ ID 

5 N0:10. 

41. A plant with reduced levels of plPAC 

. protein. 

10 42. The plant of claim 41, wherein the native 

plPAC gene is mutated. 

. 43. The plant of claim 42, wherein, the plPAC 
gene is mutated due to the insertion of a T-DNA. 

15 

44. A method for making the plant of claim 42, 
wherein a population of mutated plants are screened using 
at least one of SEQ ID NOS: 11-14 as PCR primers. 

20 45. The method of claim 44, wherein the 

population of plants is mutated by T-DNA insertion. 
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SEQUENCE LISTING 

<110> Wisconsin Alumni Research Foundation 
Spalding, Edgar P. 
Noh, Bosl 

<120> Xenobiotic Detoxification Gene from 
Plants* 

<130> WARF S212 

<150> 60/101,814 
<151> 1998-09-25 

<160> 14 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 4051 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> misc_f eature 

<222> (94) • . . (0) 

<223> Translation start codon 

<221> misc_feature 
<222> (3932) ... (0) 
<223> Stop codon 

<400> 1 

cttgaacttc acaaaacaat tgtcagattt tcaagaaaaa ctttataaaa caaaaaacat 60 

ttcattcttt ctctctctct ctctcactgc tcaatgatct cgttttctca ctaaaccaac 120 

tcgtttcttc ttactttctt taactcggat ctacaaaaaa ccatgtcgga aactaacaca 180 

accgatgcca agactgttcc agcagaagca gagaagaaga aagaacagag tttaccattc 240 

tttaaactct tttctttcgc tgataaattt gattatctct taatgttcgt tggttctctt 300 

ggtgccattg ttcatggctc ttccatgcct gtcttctttt tactctttgg tcaaatggtt 360 

aatggatttg gtaaaaacca aatggattta catcaaatgg ttcatgaagt ctctagatat 420 

tctctatatt tcgtctactt gggtttggtc gtttgcttct cttcttacgc agagatagca 480 

tgttggatgt attctggaga aagacaagta gcagcattaa ggaagaaata tcttgaagca 540 

gtattaaaac aagacgttgg gttctttgat actgatgcta gaactggtga cattgtcttt 600 

agtgtttcta ctgatactct tcttgttcaa gatgccatta gtgaaaaggt tggaaacttt 660 

atacattacc tctcaacatt tttggcggga ttagtagttg gatttgtatc agcatggaaa 720 

ttagctttgt taagtgttgc tgtgattccc ggaatcgctt tcgccggagg tttatacgct 780 

tatacactca ccggaattac ttcaaagagc cgtgaatctt atgctaacgc cggtgttatc 840 

gccgagcagg caattgctca agttcgaact gtttattctt atgttggaga gagtaaggca 900 

cttaatgcgt attcggatgc gattcagtat acgcttaagc tcggttataa agcggggatg 960 

gctaaagggt tgggtttagg atgtacttat ggaatagctt gtatgtcatg ggctttggtg 1020 

ttttggtatg ctggagtttt tattcggaat ggacaaaccg atggaggaaa ggcgtttact 1080 

gctatattct ctgctattgt tggtggaatg agtttggggc aatctttctc gaatcttggg 1140 

gcgtttagta aaggtaaagc ggctggttat aagttgatgg agataattaa ccagagaccg 1200 

acgataattc aagacccgtt ggatggaaaa tgtttggatc aagttcatgg gaacattgag 1260 

tttaaagatg tgacttttag ctatccttca cggcctgatg ttatgatctt- caggaacttt 1320 

aatattttct tcccttctgg gaaaactgtg gcggttgttg gtgggagtgg ctctggaaag 1380 

agtactgttg tttccctcat tgagagattc tatgatccaa acagcgggca aattctgttg 1440 

gatggtgttg agataaagac gcttcagttg aagtttttgc gtgaacaaat cgggcttgtg 1500 

aatcaagaac ctgcgctctt tgccactact atactagaga acatactcta tggaaagcct 1560 

gatgcaacaa tggttgaagt tgaagctgct gcttccgctg cgaatgcgca tagtttcatt 1620 

acattacttc ctaaaggcta cgacacacag gttggagaac gtggtgttca actctcaggt 1680 
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2. — 

ggacagaagc agagaattgc aattgctagg gcgatgttga aagacccaaa gattctgtta 1740 

ctagatgaag ctacaagcgc tcttgatgct agctctgaga gcattgttca ggaagcttta 1800 

gacagagtca tggtggggag gaccactgtt gttgttgctc atcgtctctg caccatcaga 1860 

aatgttgatt ccattgccgt gatacagcaa ggccaagttg ttgaaaccgg aacacatgaa 1920 

gaactcattg ccaaatccgg tgcttacgca tcccteatca ggtttcagga aatggttggt 1980 

actcgagatt tctcaaaccc gtcaactcgt cgcactcgtt caacccgttt gagccattca 2040 

ctgtcaacga aatcactcag tttaagatca ggaagtttga ggaatctgag ctattcttac 2100 

agcactggag ctgatggtcg gatagagatg atttcaaatg cagagactga ccgaaagact 2160 

cgtgcccctg aaaattactt ctacaggctt ctcaagctta attcaccgga atggccttac 2220 

tcaatcatgg gagcagtagg ctcaattctt tctggtttca ttggtcctac atttgctatt 2280 

gtgatgagca acatgatcga agtcttctac tacacagact atgattcaat ggaaaggaaa 2340 

acaaaagagt atgtcttcat ctacattggt gctggtctct atgctgtggg tgcttatttg 2400 - 

atccaacatt acttctttag catcatggga gaaaacctca caacaagagt aagaagaatg 2460 

atgctctcag ctatcttgag aaacgaagtt ggttggttcg atgaggatga acacaactca 2520 

agcctgatcg ctgcacgttt agctactgat gcagcagatg ttaaatccgc tatagccgag 2580 

agaatctcag taattctaca aaacatgact tcacttctca catccttcat agtcgccttc 2640 

atagtagaat ggagagtctc acttctcatc ttaggcacat tcccacttct agtcctcgct 2700 

aactttgctc agcaactatc tctgaagggt tttgctggag acacagctaa ggctcatgca 2760 

aagacttcaa tgattgctgg tgaaggagtc agtaacatta gaaccgtagc agctttcaat 2820 

gcacagagca agattctctc tttgttctgt catgagcttc gtgtacctca gaaaagaagc 2880 

ttaagcttat accgaagtca aacctcgggt ttcctatttg gcctctcgca gcttgctctc 2940 

tatggttctg aggctttaat tctctggtat ggtgcccacc ttgtgagtaa aggcgtgtca 3000 

accttttcca aagtgatcaa agtgtttgtg gttttggtca ttactgcaaa ctctgttgct 3060 

gaaactgtca gtcttgctcc tgaaattatt cggggaggtg aagctgttgg ttcggttttc 3120 

tcggtcttgg acaggcagac caggattgac ccggatgatg ctgatgctga tcccgtggag 3180 

acgatccgtg gagacattga gtttaggcat gttgatttcg cttacccttc aagacccgac 3240 

gtcatggttt tcagggactt taacctcaga attcgagctg gacatagcca agctcttgtg 3300 

ggcgcgagtg ggtcagggaa gagttctgta attgcgatga tcgagcggtt ttacgacctt 3360 

cttgctggaa aagtcatgat tgatggcaaa gacatccgcc ggctaaacct gaaatctcta 3420 

aggctcaaaa tcggtcttgt tcaacaagaa ccagctcttt tcgcagcaac gatcttcgac 3480 

aacatcgcct atggtaaaga tggtgcaact gaatccgagg taattgatgc agctcgagcc 3540 

gcaaatgctc acggtttcat cagtggttta cctgaaggtt acaaaactcc agtaggcgaa 3600 

agaggagtgc agttatcagg tggacagaaa cagaggatcg cgatagcaag agctgtgctc 3660 

aagaacccta cagtgttgct tctagacgaa gcaactagcg cactagatgc agaatcagaa 3720 

tgcgtgctgc aagaggcgtt agagaggctc atgagaggtc ggaccaccgt ggtagttgct 3780 

caccgcttgt ccaccataag aggtgttgat tgcattggtg tgattcaaga cgggcggatt 3840 

gtggagcaag gcagccattc agagctcgtt agccgaccag agggagctta ttcaaggctg 3900 

ttacagcttc aaacacatag gatttgaagc ttgatcatgg attaaaaaca aaaaatcggt 3960 

ttgtgtaatt ttttttatat taaaacttta atttggaaga tttctatgga ctataacgat 4020 

aatatgaata ggtgtagata atgaagcttt t 4051 

<210> 2 
<211> 1254 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 2 

Met Ser Glu Thr Asn Thr Thr Asp Ala Lys Thr Val Pro Ala Glu Ala 

15 10 15 

Glu Lys Lys Lys Glu Gin Ser Leu Pro Phe Phe Lys Leu Phe Ser Phe 

20 25 30 

Ala Asp Lys Phe Asp Tyr Leu Leu Met Phe Val Gly Ser Leu Gly Ala 

35 40 45 

He Val His Gly Ser Ser Met Pro Val Phe Phe Leu Leu Phe Gly Gin 



His Glu Val Ser Arg Tyr Ser Leu Tyr Phe Val Tyr Leu Gly Leu Val 

85 90 95 

Val Cys Phe Ser Ser Tyr Ala Glu He Ala. Cys Trp Met Tyr Ser Gly 

100 105 110 

Glu Arg Gin Val Ala Ala Leu Arg Lys Lys Tyr Leu Glu Ala Val Leu 
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115 




120 


Lys 


Gin 


Asp 


Val 


Gly Phe Phe Asp 




130 






135 


Val 


Phe 


Ser 


Val 


Ser Thr Asp Thr 


145 








150 


Glu 


Lys 


Val 


Gly 


Asn Phe He His 










165 


Leu 


Val 


Val 


Gly 


Phe Val Ser Ala 








180 




Ala 


Val 


He 


Pro 


Gly He Ala Phe 






195 




200 


Leu 


Thr 


Gly 


He 


Thr Ser Lys Ser 




210 




215 


Val 


He 


Ala 


Glu 


Gin Ala He Ala 


225 








230 


Val 


Glv 


Glu 


Ser 


Lys Ala Leu Asn 










245 


Thr 


Leu 


Lys 


Leu 


Gly Tyr Lys Ala 








260 




Glv 


Cvs 


Thr 


Tvr 


Gly He Ala Cys 






275 




280 


Tvr 


Ala 


Glv 


Val 


Phe He Arg Asn 




290 






295 


Phe 


Thr 


Ala 


He 


Phe Ser Ala He 


305 








310 


Ser 


Phe 


Ser 


Asn 


Leu Gly Ala Phe 










325 


Lys 


Leu 


Met 


Glu 


He lie Asn Gin 








340 




Leu 


Asp 


Gly 


Lys 


Cys Leu Asp Gin 






355 




360 


Asp 


Val 


Thr 


Phe 


Ser Tyr Pro Ser 




370 






375 


Asn 


Phe 


Asn 


He 


Phe Phe Pro Ser 


385 








390 


Gly 


Ser 


Gly 


Ser 


Gly Lys Ser Thr 










405 


Tyr 


Asp 


Pro 


Asn 


Ser Gly Gin He 








420 




Thr 


Leu 


Gin 


Leu 


Lys Phe Leu Arg 






435 




440 


Glu 


Pro 


Ala 


Leu 


Phe Ala Thr Thr 




450 






455 


Lys 


Pro 


Asp 


Ala 


Thr Met Val Glu 


465 








470 


Asn 


Ala 


His 


Ser 


Phe He Thr Leu 










485 


Val 


Gly 


Glu 


Arg 


Gly Val Gin Leu 








500 




Ala 


He 


Ala 


Arg 


Ala Met Leu Lys 






515 




520 


Glu 


Ala 


Thr 


Ser 


Ala Leu Asp Ala 




530 






535 


Ala 


Leu 


Asp 


Arg 


Val Met Val Gly 


545 








550 


Arg 


Leu 


Cys 


Thr 


He Arg Asn Val 










565 


Gly 


Gin 


Val 


Val 


Glu Thr Gly Thr 








580 




Gly 


Ala 


Tyr 


Ala 


Ser Leu He Arg 






595 




600 


Asp 


Phe 


Ser 


Asn 


Pro Ser Thr Arg 



3 



125 



Thr 


Asp 


Ala 


Arg Thr 


Gly 


Asp 


He 








140 








Leu 


Leu 


Val 


Gin Asp 


Ala 


He 


Ser 






155 








160 


Tyr 


Leu 


Ser 


Thr Phe 


Leu 


Ala 


Gly 




170 








175 




Trp 


Lys 


Leu 


Ala Leu 


Leu 


Ser 


Val 


185 








190 






Ala 


Gly 


Gly 


Leu Tyr 


Ala 


Tyr 


Thr 








205 








Arg 


Glu 


Ser 


Tyr Ala 


Asn 


Ala 


Gly 








220 








Gin 


Val 


Arq 


Thr Val 


Tvr 


Ser 


Tvr 






235 








240 


Ala 


Tvr 


Ser 


Asp Ala 


He 


Gin 


Tvr 




250 








255 




Gly 


Met 


Ala 


Lvs Glv 


Leu 


Glv 


Leu 


265 








270 






Met 


Ser 


Trp 


Ala Leu 


Val 


Phe 


Trp 








285 








Glv 


Gin 


Thr 


Asd Glv 


Glv 


Lvs 


Ala 








300 








Val 


Glv 


Glv 


Met Ser 


Leu 


Glv 


Gin 






315 








320 


Ser 


Lvs 


Glv 


Lys Ala 


Ala 


Glv 


Tvr 




330 








335 




Ara 


Pro 


Thr 


He He 


Gin 


Asr> 


Pro 


345 








350 






Val 


His 


Gly 


Asn He 


Glu 


Phe 


Lys 








365 








Arg 


Pro 


Asp 


Val Met 


He 


Phe 


Arg 








380 








Gly 


Lys 


Thr 


Val Ala 


Val 


Val 


Gly 






395 








400 


Val 


Val 


Ser 


Leu He 


Glu 


Arg 


Phe 




410 








415 




Leu 


Leu 


Asp 


Gly Val 


Glu 


He 


Lys 


425 








430 






Glu 


Gin 


He 


Gly Leu 


Val 


Asn 


Gin 








445 








He 


Leu 


Glu 


Asn He 


Leu 


Tyr 


Gly 








460 








Val 


Glu 


Ala 


Ala Ala 


Ser 


Ala 


Ala 






475 








480 


Leu 


Pro 


Lys 


Gly Tyr 


Asp 


Thr 


Gin 




490 








495 




Ser 


Gly 


Gly 


Gin Lys 


Gin 


Arg 


lie 


505 








510 






Asp 


Pro 


Lys 


He Leu 


Leu 


Leu 


Asp. 








525 








Ser 


Ser 


Glu 


Ser He 


Val 


Gin 


Glu 








540 








Arg 


Thr 


Thr 


Val Val 


Val 


Ala 


His 






555 








560 


Asp 


Ser 


He 


Ala Val 


He 


Gin 


Gin 




570 








575 




His 


Glu 


Glu 


Leu He 


Ala 


Lys 


Ser 


585 








590 






Phe 


Gin 


Glu 


Met Val 


Gly 


Thr 


Arg 








605 








Arg 


Thr 


Arg 


Ser Thr 


Arg 


Leu 


Ser 
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610 




615 








620 










His Ser 


Leu Ser 


Thr Lys Ser 


Leu 


Ser Leu 


Arg 


Ser 


Gly 


Ser 


Leu 


Arg 


625 




630 






635 










640 


Asn Leu 


Ser Tyr 


Ser Tyr Ser 


Thr 


Gly Ala 


Asp 


Gly 


Arg 


He 


Glu 


Met 






645 




650 










655 




lie Ser 


Asn Ala 


Glu Thr Asp 


Arg 


Lys Thr 


Arg 


Ala 


Pro 


Glu 


Asn 


Tyr 




660 






665 








670 






Phe Tyr 


Arg Leu. 


Leu Lys Leu 


Asn 


Ser Pro 


Glu 


Trp 


Pro 


Tyr 


Ser 


lie 


675 




680 








685 








Met Gly 


Ala Val; 


Gly Ser He 


Leu 


Ser Gly 


Phe 


He 


Gly 


Pro 


Thr 


Phe 


690 




695 








700 










Ala He 


Val Met 


Ser Asn Met 


He 


Glu Val 


Phe 


Tyr 


Tyr 


Thr 


Asp 


Tyr 


705 




710 






715 










720 


Asp Ser 


Met Glu 


Arg Lys Thr 


Lys 


Glu Tyr 


Val 


Phe 


lie 


Tyr 


lie 


Gly 






725 




730 










735 




Ala Gly 


Leu Tyr 


Ala Val Gly 


Ala 


Tyr Leu 


He 


Gin 


His 


Tyr 


Phe 


Phe 




740 






745 








750 






Ser He 


Met Gly 


Glu Asn Leu 


Thr 


Thr Arg 


Val 


Arg 


Arg 


Met 


Met 


Leu 




755 




760 






765 








Ser Ala 


He Leu 


Arg Asn Glu 


Val 


Gly Trp 


Phe 


Asp 

* 


Glu 


Asp 


Glu 


His 


770 




775 








780 










Asn Ser 


Ser Leu 


He Ala Ala 


Arq 


Leu Ala 


Thr 


Asp 


Ala 


Ala 


Asp 


Val 


785 




790 






795 










800 


Lys Ser 


Ala He 


Ala Glu Arg 


He 


Ser Val 


He 


Leu 


Gin 


Asn 


Met 


Thr 




805 




810 










815 




Ser Leu 


Leu Thr 


Ser Phe He 


Val 


Ala Phe 


He 


Val 


Glu 


Trp 


Arq 


Val 




820 






825 








830 






Ser Leu 


Leu He 


Leu Gly Thr 


Phe 


Pro Leu 


Leu 


Val 


Leu 


Ala 


Asn 


Phe 




835 


840 








845 








Ala Gin 


Gin Leu 


Ser Leu Lys 


Gly 


Phe Ala 


Gly 


Asp 


Thr 


Ala 


Lys 


Ala 


850 




855 








860 










His Ala 


Lys Thr 


Ser Met He 


Ala 


Gly Glu 


Gly 


Val 


Ser 


Asn 


He 


Arg 


865 




870 




875 










880 


Thr Val 


Ala Ala 


Phe Asn Ala 


Gin 


Ser Lys 


He 


Leu 


Ser 


Leu 


Phe 


Cys 






885 




890 










895 




His Glu 


Leu Arg 


Val Pro Gin 


Lys 


Arg Ser 


Leu 


Ser 


Leu 


Tyr 


Arg 


Ser 




900 






905 








910 






Gin Thr 


Ser Gly 


Phe Leu Phe 


Gly 


Leu Ser 


Gin 


Leu 


Ala 


Leu 


Tyr 


Gly 




915 




920 








925 








Ser Glu 


Ala Leu 


lie Leu Trp 


Tyr 


Gly Ala 


His 


Leu 


Val 


Ser 


Lys 


Gly 


930 




935 








940 










Val Ser 


Thr Phe 


Ser Lys Val 


He 


Lys Val 


Phe 


Val 


Val 


Leu 


Val 


He 


945 




950 




955 










960 


Thr Ala 


Asn Ser 


Val Ala Glu 


Thr 


Val Ser 


Leu 


Ala 


Pro 


Glu 


He 


He 






965 




970 










975 




Arg Gly 


Gly Glu 


Ala Val Gly 


Ser 


Val Phe 


Ser 


Val 


Leu 


Asp 


Arg 


Gin 




980 






985 








990 






Thr Arg 


He Asp 


Pro Asp Asp 


Ala 


Asp Ala 


Asp 


Pro 


Val 


Glu 


Thr 


He 




995 




1000 






1005 






Arg Gly 


Asp He 


Glu Phe Arg 


His Val Asp 


Phe 


Ala 


Tyr 


Pro 


Ser 


Arg 


1010 


1015 






1020 








Pro Asp Val Met 


Val Phe Arg Asp Phe Asn 


Leu 


Arg He Arg 


Ala 


Gly 


1025 




1030 






1035 








1040 


His Ser 


Gin Ala 


Leu Val Gly Ala Ser Gly 


Ser Gly Lys 


Ser 


Ser 


Val 



1045 1050 1055 



He Ala Met He Glu Arg Phe Tyr Asp Leu Leu Ala Gly Lys Val Met 

1060 1065 1070 

He Asp Gly Lys Asp lie Arg Arg Leu Asn Leu Lys Ser Leu Arg Leu 

1075 : 1080 1085 

Lys He Gly Leu Val Gin Gin Glu Pro Ala Leu Phe Ala Ala Thr He 

1090 1095 1100 

Phe Asp Asn He Ala Tyr Gly Lys Asp Gly Ala Thr Glu Ser Glu Val 
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1105 1110 1115 1120 

He Asp Ala Ala Arg Ala Ala Asn Ala His Gly Phe He Ser Gly Leu 

1125 1130 1135 

Pro Glu Gly Tyr Lys Thr Pro Val Gly Glu Arg Gly Val Gin Leu Ser 

1140 1145 1150 

Gly Gly Gin Lys Gin Arg He Ala He Ala Arg Ala Val Leu Lys Asn 

1155 1160 1165 

Pro Thr Val Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Ala Glu 

1170 1175 1180 

Ser Glu Cys Val Leu Gin Glu Ala Leu Glu Arg Leu Met Arg Gly Arg 
1185 1190 1195 1200 

Thr Thr Val Val Val Ala His Arg Leu Ser Thr He Arg Gly Val Asp 

1205 1210 1215 

Cys He Gly Val He Gin Asp Gly Arg He Val Glu Gin Gly Ser His 

1220 1225 1230 

Ser Glu Leu Val Ser Arg Pro Glu Gly Ala Tyr Ser Arg Leu Leu Gin 

1235 1240 1245 

Leu Gin Thr His Arg He 
1250 

<210> 3 

<211> 1280 

<212> PRT 

<213> Homo sapiens 



<300> 

<308> Genbank P08183 
<309> 1997-11-01 

<400> 3 



Met 


Asp 


Leu 


Glu Gly 


Asp 


Arg 


Asn Gly Gly 


Ala Lys Lys 


Lys Asn Phe 


1 






5 






10 




15 


Phe 


Lys 


Leu 


Asn Asn 


Lys 


Ser 


Glu Lys Asp 


Lys Lys Glu 


Lys Lys Pro 








20 






25 




30 


Thr 


Val 


Ser 


Val Phe 


Ser 


Met 


Phe Arg Tyr 


Ser Asn Trp 


Leu Asp Lys 






35 








40 


45 


Leu 


Tyr 


Met 


Val Val 


Gly 


Thr 


Leu Ala Ala 


He. He His 


Gly Ala Gly 




50 








55 




60 


Leu 


Pro 


Leu 


Met Met 


Leu 


Val 


Phe Gly Glu 


Met Thr Asp 


He Phe Ala 


65 








70 




75 


80 


Asn 


Ala 


Gly 


Ash Leu 


Glu 


Asp 


Leu Met Ser 


Asn .lie Thr 


Asn Arg Ser 








85 






90 




95 


Asp 


He 


Asn 


Asp Thr 


Gly 


Phe 


Phe Met Asn 


Leu Glu Glu 


Asp Met Thr 








100 






105 




110 


Arg 


Tyr 


Ala 


Tyr Tyr 


Tyr 


Ser 


Gly lie Gly 


Ala Gly Val 


Leu Val Ala 






115 








120 


125 




Ala 


Tyr 


He 


Gin Val 


Ser 


Phe 


Trp Cys Leu 


Ala Ala Gly 


Arg Gin lie 




130 








135 




140 


His 


Lys 


He 


Arg Lys 


Gin 


Phe 


Phe His Ala 


lie Met Arg 


Gin Glu lie 


145 








150 






155 


160 


Gly 


Trp 


Phe 


Asp Val 


His 


Asp 


Val Gly Glu 


Leu Asn Thr 


Arg Leu Thr 








165 






170 




175 


Asp 


Asp 


Val 


Ser Lys 


He 


Asn 


Glu Val He 


Gly Asp Lys 


He Gly Met 








180 






185 


190 


Phe 


Phe 


Gin 


Ser Met 


Ala 


Thr 


Phe Phe Thr 


Gly Phe He 


Val Gly Phe 






195 








200 


205 


Thr 


Arg 


Gly 


Trp Lys 


Leu 


Thr 


Leu Val He 


Leu Ala He 


Ser Pro Val 




210 








215 




220 




Leu 


Gly 


Leu 


Ser Ala 


Ala 


Val 


Trp Ala Lys 


He Leu Ser 


Ser Phe Thr 


225 








230 




235 


240 


Asp 


Lys 


Glu 


Leu Leu 


Ala 


Tyr 


Ala Lys Ala 


Gly Ala Val 


Ala Glu Glu 








245 






250 


255 
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Val Leu Ala Ala 
260 

Glu Leu Glu Arg 
275 

lie Lys Lys Ala 
290 

Leu lie Tyr Ala 
305 

Val Leu Ser Gly 

Ser Val Leu lie 
340 

Glu Ala Phe Ala 
355 

lie Asp Asn Lys 
370 

Pro Asp Asn lie 
385 

Tyr Pro Ser Arg 

Val Gin Ser Gly 
420 

Lys Ser Thr Thr 
435 

Gly Met Val Ser 
450 

Phe Leu Arg Glu 
465 

Ala Thr Thr He 

Met Asp Glu He 
500 

He Met Lys Leu 
515 

Ala Gin Leu Ser 
530 

Leu Val Arg Asn 
545 

Leu Asp Thr Glu 

Arg Lys Gly Arg 
580 

Arg Asn Ala Asp 
595 

Lys Gly Asn His 
610 

Leu Val Thr Met 
625 

Ala Asp Glu Ser 

Asp Ser Arg Ser 
660 

Arg Gly Ser Gin 
675 

Asp Glu Ser He 
690 

Leu Thr Glu Trp 
705 

Asn Gly Gly Leu 

Gly Val Phe Thr 
740 



He Arg Thr Val He Ala 
265 

Tyr Asn Lys Asn Leu Glu 

280 .. 

He Thr Ala Asn He Ser 

295 . 

Ser Tyr Ala Leu Ala Phe 
310 

Glu Tyr Ser He Gly Gin 
325 330 
Gly Ala Phe Ser Val Gly 
345 

Asn Ala Arg Gly Ala Ala 
360 

Pro Ser He Asp Ser Tyr 
375 

Lys Gly Asn Leu Glu Phe 
390 

Lys Glu Val Lys He Leu 
405 410 
Gin Thr Val Ala Leu Val 
425 

Val Gin Leu Met Gin Arg 
440 

Val Asp Gly Gin Asp He 
455 

He He Gly Val Val Ser 
470 

Ala Glu Asn He Arg Tyr 
485 490 
Glu Lys Ala Val Lys Glu 
505 

Pro His Lys Phe Asp Thr 
520 

Gly Gly Gin Lys Gin Arg 

* 535 

Pro Lys He Leu Leu Leu 
550 

Ser Glu Ala Val Val Gin 
565 570 
Thr Thr He Val He Ala 
585 

Val lie Ala Gly Phe Asp 
600 

Asp Glu Leu Met Lys Glu 
615 

Gin Thr Ala Gly Asn Glu 
630 

Lys Ser Glu lie Asp Ala 
645 * 650 

Ser Leu He Arg Lys Arg 
665 

Ala Gin Asp Arg Lys Leu 
680 

Pro Pro Val Ser Phe Trp 
695 

Pro Tyr Phe Val Val Gly 
710 

Gin Pro Ala Phe Ala He 
725 730 
Arg He Asp Asp Pro Glu 
745 



Phe Gly Gly Gin Lys Lys 
270 

Glu Ala Lys Arg He Gly 
285 

lie Gly Ala Ala. Phe Leu 
300 

Trp Tyr Gly Thr Thr Leu 
315 320 
Val Leu Thr Val Phe Phe 
335 

Gin Ala Ser Pro Ser He 
350 

Tyr Glu He Phe Lys He 
365 

Ser Lys Ser Gly His Lys 
380 

Arg Asn Val His Phe Ser 
395 400 
Lys Gly Leu Asn Leu Lys 
415 

Gly Asn Ser Gly Cys Gly 
430 

Leu Tyr Asp Pro Thr Glu 
445 

Arg Thr He Asn Val Arg 
460 

Gin Glu Pro Val Leu Phe 
475 480 
Gly Arg Glu Asn Val Thr 
495 

Ala Asn Ala Tyr Asp Phe 
510 

Leu Val Gly Glu Arg Gly 
525 

He Ala He Ala Arg Ala 
540 

Asp Glu Ala Thr Ser Ala 
555 560 
Val Ala Leu Asp Lys Ala 
575 

His Arg Leu Ser Thr Val 
590 

Asp Gly Val He Val Glu 
605 

Lys Gly He Tyr Phe Lys 
620 

Val Glu Leu Glu Asn Ala 
635 640 
Leu Glu Met Ser Ser Asn 
655 

Ser Thr Arg Arg Ser Val 
670 

Ser Thr Lys Glu Ala Leu 
685 

Arg lie Met Lys Leu Asn 
700 

Val Phe Cys Ala He He 
715 720 
He Phe Ser Lys He He 
735 

Thr Lys Arg Gin Asn Ser 
750 
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Asn Leu Phe Ser Leu Leu Phe Leu Ala Leu Gly lie lie Ser Phe lie 

755 760 765 

Thr Phe Phe Leu Gin Gly Phe Thr Phe Gly Lys Ala Gly Glu lie Leu 

770 775 780 

Thr Lys Arg Leu Arg Tyr Met Val Phe Arg Ser Met Leu Arg Gin Asp 
785 790 795 800 

Val Ser Trp Phe Asp Asp Pro Lys Asn Thr. Thr Gly Ala Leu Thr Thr 

805 810 815 

Arg Leu Ala Asn Asp Ala Ala Gin Val Lys Gly Ala lie Gly Ser Arg 

820 825 " 830 

Leu Ala Val He Thr Gin Asn He Ala Asn Leu Gly Thr Gly He He 

835 840 845 

He Ser Phe He Tyr Gly Trp Gin Leu Thr Leu Leu Leu Leu Ala He 

850 855 860 

Val Pro He He Ala He Ala Gly Val Val Glu Met Lys Met Leu Ser 
865 870 875 880 

Gly Gin Ala Leu Lys Asp Lys Lys Glu Leu Glu Gly Ala Gly Lys He 

885 890 ~ 895 

Ala Thr Glu Ala He Glu Asn Phe Arg Thr Val Val Ser Leu Thr Gin 

900 905 910 

Glu Gin Lys Phe Glu His Met Tyr Ala Gin Ser Leu Gin Val Pro Tyr 

915 920 925 . 

Arg Asn Ser Leu Arg Lys Ala His He Phe Gly He Thr Phe Ser Phe 

930 935 940 

Thr Gin Ala Met Met Tyr Phe Ser Tyr Ala Gly Cys Phe Arg Phe Gly 
945 950 955 960 

Ala Tyr Leu Val Ala His Lys Leu Met Ser Phe Glu Asp Val Leu Leu 

965 970 975 

Val Phe Ser Ala Val Val Phe Gly Ala Met Ala Val Gly Gin Val Ser 

980 985 990 

Ser Phe Ala Pro Asp Tyr Ala Lys Ala Lys He Ser Ala Ala His He 

995 1000 1005 

He Met He He Glu Lys Thr Pro Leu He Asp Ser Tyr Ser Thr Glu 

1010 1015 1020 

Gly Leu Met Pro Asn Thr Leu Glu Gly Asn Val Thr Phe Gly Glu Val 
1025 1030 1035 1040 

Val Phe Asn Tyr Pro Thr Arg Pro Asp He Pro Val Leu Gin Gly Leu 

1045 1050 1055 

Ser Leu Glu Val Lys Lys Gly Gin Thr Leu Ala Leu Val Gly Ser Ser 

1060 1065 1070. 

Gly Cys Gly Lys Ser Thr Val Val Gin Leu Leu Glu Arg Phe Tyr Asp 

1075 1080 1085 

Pro Leu Ala Gly Lys Val Leu Leu Asp Gly Lys Glu He Lys Arg Leu 

1090 1095 ' 1100 

Asn Val Gin Trp Leu Arg Ala His Leu Gly He Val Ser Gin Glu Pro 
1105 1110 1115 1120 

He Leu Phe Asp Cys Ser He Ala Glu Asn lie Ala Tyr Gly Asp Asn 

1125 1130 1135 

Ser Arg Val Val Ser Gin Glu Glu lie Val Arg Ala Ala Lys Glu Ala 

1140 1145 1150 

Asn He His Ala Phe He Glu Ser Leu Pro Asn Lys Tyr Ser Thr Lys 

1155 1160 1165 

Val Gly Asp Lys Gly Thr Gin Leu Ser Gly Gly Gin Lys Gin Arg He 

1170 1175 1180 

Ala He Ala Arg Ala Leu Val Arg Gin Pro His He Leu Leu Leu Asp 
1185 1190 1195 1200 

Glu Ala Thr Ser Ala Leu Asp Thr Glu Ser Glu Lys Val Val Gin Glu 

1205 1210 1215 

Ala Leu Asp Lys Ala Arg Glu Gly Arg Thr Cys He Val He Ala His 

1220 1225 1230 

Arg Leu Ser Thr He Gin Asn Ala Asp Leu He Val Val Phe Gin Asn 
1235 1240 1245 
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Gly Arg Val Lys Glu His Gly Thr His Gin Gin Leu Leu Ala Gin Lys 

1250 1255 1260 

Gly lie Tyr Phe Ser Met Val Ser Val Gin Ala Gly Thr Lys Arg Gin 
1265 1270 1275 1280 





<210> 


4 


















<211> 


1175 
















<212> 


PRT 


















<213> 


Mus 


mus cuius 












<300> 




















<308> 


Genbank P06795 












<309> 


1998-07- 


-15 














<400> 


4 
















Ser 


Asn Ser 


Ser 


Leu 


Glu 


Glu 


Glu Met 


Ala He 


Tyr Ala Tyr Tyr 


Tyr 


1 






5 








10 


15 




Thr 


Gly He 


Gly 


Ala 


Gly 


Val 


Leu He 


Val Ala 


Tyr He Gin Val 


Ser 






20 








25 




30 




Leu 


Trp Cys 


Leu 


Ala 


Ala 


Gly 


Arg Gin 


lie His 


Lys He Arg Gin 


Lys 




35 










40 




45 




Phe 


Phe His 


Ala 


He 


Met 


Asn 


Gin Glu 


He Gly 


Trp Phe Asp Val 


His 




50 








55 




60 




Asp 


Val Gly 


Glu 


Leu 


Asn 


Thr 


Axg Leu 


Thr Asp 


Asp Val Ser Lys 


He 


65 








70 






75 




80 


Asn 


Asp Gly 


He 


Gly 


Asp 


Lys 


He Gly 


Met Phe 


Phe Gin Ser He 


Thr 








85 








90 


95 




Thr 


Phe Leu 


Ala 


Gly 


Phe 


He 


He Gly 


Phe He 


Ser Gly Trp Lys 


Leu 






100 








105 




110 




Thr 


Leu Val 


He 


Leu 


Ala 


Val 


Ser Pro 


Leu He 


Gly Leu Ser Ser 


Ala 




115 










120 




125 




Leu 


Trp Ala 


Lys 


Val 


Leu 


Thr 


Ser Phe 


Thr Asn 


Lys Glu Leu Gin 


Ala 




130 








I JO 






140 




Tyr 


Ala Lys 


Ala 


Gly 


Ala 


val 


Ala Glu 


Glu Val 


Leu Ala Ala He 


Arg 


145 








IDU 






155 




160 


Thr 


Val He 


Ala 


Phe 


Gly 


Gly 


Gin Gin 


Lys Glu 


Leu Glu Arg Tyr 


Asn 








165 








170 


175 




Lys 


Asn Leu 


Glu 


Glu 


/via 


Lys 


Asn Val 


Gly He 


Lys Lys Ala He 


Thr 






180 








185 




190 




Ala 


Ser He 


Ser 


He 


Gly 


He 


Ala Tyr 


Leu Leu 


Val Tyr Ala Ser 


Tyr 




195 










200 




205 




Ala 


Leu Ala 


Phe 


Trp 


Tyr 


Gly 


Thr Ser 


Leu Val 


Leu Ser Asn Glu 


Tyr 




210 








215 






220 




Ser 


He Gly 


Glu 


Val 


Leu 


Thr 


Val Phe 


Phe Ser 


He Leu Leu Gly 


Thr 


225 








230 






235 




240 


PVl A 

trne 


oer lie 


Gly 


His 


Leu 


Ala 


Pro Asn 


ixe giu 


Ala Phe Ala Asn 


Ala 








245 








250 


255 




Arg 


Gly Ala 


Ala 


Phe 


Glu 


He 


Phe Lys 


He He 


Asp Asn Glu Pro 


Ser 






260 








265 




270 




He 


Asp Ser 


Phe 


Ser 


Thr 


Lys 


Gly Tyr 


Lys Pro 


Asp Ser He Met 


Gly 




275 










280 




285 




Asn 


Leu Glu 


Phe 


Lys 


Asn 


Val 


His Phe 


Asn Tyr 


Pro Ser Arg Ser 


Glu 




290 








295 




300 




Val 


Gin He 


Leu 


Lys 


Gly 


Leu 


Asn Leu 


Lys Val 


Lys Ser Gly Gin 


Thr 


305 








310 






315 




320 


Val 


Ala Leu 


Val 


Gly 


Asn 


Ser 


Gly Cys 


Gly Lys 


Ser Thr Thr Val 


Gin 








325 








330 


335 




Leu 


Met Gin 


Arg 


Leu 


Tyr 


Asp 


Pro Leu 


Glu Gly 


Val Val Ser He 


Asp 






340 








345 




350 




Gly 


Gin Asp 


He 


Arg 


Thr 


He 


Asn Val 


Arg Tyr 


Leu Arg Glu He 


He 




355 










360 




365 




Gly 


Val Val 


Ser 


Gin 


Glu 


Pro 


Val Leu 


Phe Ala 


Thr Thr He Ala 


Glu 
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370 






375 




Asn 


He 


Arg 


Tyr 


Gly Arg Glu 


Asp 


385 








390 




Ala 


Val 


Lys 


Glu 


Ala Asn Ala 


Tyr 










405 


Gin 


Phe 


Asp 


Thr 


Leu Val Gly 


Glu 








420 






Gin 


Lys 


Gin 


Arg 


He Ala He 


Ala 






435 




- 


440 


lie 


Leu 


Leu 


Leu 


Asp Glu Ala 


Thr 




450 






455 




Ala 


Val 


Val 


Gin 


Ala Ala Leu 


Asp 


465 








470 




lie 


Val 


He 


Ala 


His Arg Leu 


Ser 










485 




Ala 


Gly 


Phe 


Asp 


Gly Gly Val 


He 








500 






Leu 


Met 


Arg 


Glu 


Lys Gly He 


Tyr 






515 






520 


Arg 


Gly 


Asn 


Glu 


lie Glu Pro 


Gly 




530 






535 




Asp 


Thr 


Asp 


Ala 


Ser Glu Leu 


Thr 


545 








550 




lie 


Arg 


Arg 


Ser 


He Tyr Arg 


Ser 










565 




Arg 


Arg 


Leu 


Ser 


Met Lys Glu 


Ala 








580 






Ser 


Phe 


Trp 


Arg 


lie Leu Asn 


Leu 






595 






600 


Leu 


Val 


Gly 


Val 


Leu Cys Ala 


Val 




610 






615 




Phe 


Ala 


He 


Val 


Phe Ser Arg 


He 


625 








630 




Asp 


His 


Glu 


Thr 


Lys Arg Gin 


Asn 










645 




Leu 


Val 


Met 


Gly 


Leu He Ser 


Phe 








660 






Thr 


Phe 


Gly 


Lys 


Ala Gly Glu 


He 






675 






680 


Val 


Phe 


Lys 


Ser 


Met Leu Arg 


Gin 




690 






695 




Lys 


Asn 


Ser 


Thr 


Gly Ser Leu 


Thr 


705 








710 




Ser 


Val 


Lys 


Gly 


Ala Met Gly 


Ala 










725 




Val 


Ala 


Asn 


Leu 


Gly Thr Gly 


Val 








740 




Gin 


Leu 


Thr 


Leu 


Leu Leu Val 


Val 






755 






760 


Gly 


He 


He 


Glu 


Met Lys Leu 


Leu 




770 






775 




Lys 


Gin 


Leu 


Glu 


He Ser Gly 


Lys 


785 








790 


Phe 


Arg 


Thr 


He 


Val Ser Leu 


Thr 










805 




Tyr 


Ala 


Gin 


Ser 


Leu Gin Val 


Pro 








820 






His 


Val 


Phe 


Gly 


He Thr Phe 


Ser 






835 






840 


Ser 


Tyr 


Ala 


Ala 


Cys Phe Arg 


Phe 




850 






855 




Leu 


Met 


Thr 


Phe 


Glu Asn Val 


Met 



9 



380 



Val 


Thr 


Met 


Asp 


Glu 


He 


Glu 


Lys 






395 










400 


Asp 


Phe 


He 


Met 


Lys 


Leu 


Pro 


His 




410 










415 




Arg 


Gly 


Ala 


Gin 


Leu 


Ser 


Gly 


Gly 


425 










430 






Arg 


Ala 


Leu 


Val 


Arg 


Asn 


Pro 


Lys 










445 








Ser 


Ala 


Leu 


Asp 


Thr 


Glu 


Ser 


Glu 








460 










Lys 


Ala 


Arg 


Glu 


Gly 


Arg 


Thr 


Thr 






475 










480 


Thr 


Val 


Arg 


Asn 


Ala 


Asp 


Val 


He 




490 










495 




Val 


Glu 


Gin 


Gly 


Asn 


His 


Asp 


Glu 


505 










510 






Phe 


Lys 


Leu 


Val 


Met 


Thr 


Gin 


Thr 










525 








Asn 


Asn 


Ala 


Tyr 


Gly 


Ser 


Gin 


Ser 








540 










Ser 


Glu 


Glu 


Ser 


Lys 


Ser 


Pro 


Leu 






555 










560 


Val 


His 


Arg 


Lys 


Gin 


Asp 


Gin 


Glu 




570 










575 




Val 


Asp 


Glu 


Asp 


Val 


Pro 


Leu 


Val 


585 










590 






Asn 


Leu 


Ser 


Glu 


Trp 


Pro 


Tyr 


Leu 










605 






He 


Asn 


Gly 


Cys 


He 


Gin 


Pro 


Val 








620 










Val 


Gly 


Val 


Phe 


Ser 


Arg 


Asp 


Asp 






635 










640 


Cys 


Asn 


Leu 


Phe 


Ser 


Leu 


Phe 


Phe 




650 










655 




Val 


Thr 


Tyr 


Phe 


Phe 


Gin 


Gly 


Phe 


665 










670 






Leu 


Thr 


Lys 


Arg 


Val 


Arg 


Tyr Met 










685 








Asp 


He 


Ser 


Trp 


Phe 


Asp 


Asp 


His 








700 










Thr 


Arg 


Leu 


Ala 


Ser 


Asp 


Ala 


Ser 






715 










720 


Arg 


Leu 


Ala 


Val 


Val 


Thr 


Gin 


Asn 


730 










735 




He 


Leu 


Ser 


Leu 


Val 


Tyr 


Gly 


Trp 


745 










750 






He 


He 


Pro 


Leu 


He 


Val 


Leu 


Gly 










765 








Ser 


Gly 


Gin 


Ala 


Leu 


Lys 


Asp 


Lys 








780 










He 


Ala 


Thr 


Glu 


Ala 


He 


Glu 


Asn 






795 










800 


Arg 


Glu 


Gin 


Lys 


Phe 


Glu 


Thr 


Met 




810 










815 




Tyr 


Arg 


Asn 


Ala 


Met 


Lys 


Lys 


Ala 


825 










830 






Phe 


Thr 


Gin 


Ala 


Met 


Met 


Tyr 


Phe 










845 






Gly 


Ala 


Tyr 


Leu 


Val 


Ala 


Gin 


Gin 








860 










Leu 


Val 


Phe 


Ser 


Ala 


Val 


Val 


Phe 
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Gly 


Ala 


Met 


Ala Ala 


Gly 


Asn 


Thr 


Ser 


Ser 


Phe 


Ala 


Pro 


Asp 


Tyr 


Ala 










835 










890 










895 




Lys 


Ala 


Lys 


Val 
900 


Ser 


Ala 


Ser 


His 


He 
905 


lie 


Arg 


He 


He 


Glu 
910 


Lys 


Thr 


Pro 


Glu 


lie 


Asp Ser 


Tyr 


Ser 


Thr 


Glu 


Gly 


Leu 


Lys 


Pro 


Thr 


Leu 


Leu 






915 










920 










925 








Glu 


Gly 
930 


Asn 


Val 


Lys 


Phe 


Asn 
935 


Gly 


Val 


Gin 


Phe 


Asn 
940 


Tyr 


Pro 


Thr 


Arg 


Pro 


Asn 


He 


Pro 


Val 


Leu 


Gin 


Gly 


Leu 




Leu 


Glu 


Val 


Lys 


Lys 


t»xy 


945 










950 










955 










960 


Gin 


Thr 


Leu 


Ala 


Leu 
965 


Val 


Gly 


Ser 


Ser 


Gly 
970 


Cys 


Gly 


Lys 


Ser 


Thr 
975 


Val 


Val 


Gin 


Leu 


Leu 
980 


Glu 


Arg 


Phe 


Tyr 


Asp 
985 


Pro 


Met 


Ala 


Gly 


Ser 
990 


Val 


Phe 


Leu 


Asp 


Gly 


Lys 


Glu 


He 


Lys 


Gin 


Leu 


Asn 


Val 


Gin 


Trp 


Leu 


Arg 


Ala 






995 










1000 








1005 






His. 


Leu 


Gly 


He 


Val 


Ser 


Gin 


Glu 


Pro 


He 


Leu 


Phe 


Asp Cys 


Ser 


He 




1010 








1015 








1020 








Ala 


Glu 


Asn 


He 


Ala 


Tyr 


Gly Asp Asn 


Ser 


Arg 


Ala 


Val 


Ser 


His 


Glu 


1025 








1030 








1035 








1040 


Glu 


lie 


Val 


Arg Ala 


Ala Lys Glu Ala 


Asn 


He 


His 


Gin 


Phe 


He 


Asp 



104S^ 1050 1055 

Ser Leu Pro Asp Lys Tyr Asn Thr Arg Val Gly Asp Lys Gly Thr Gin 

1060 1065 1070 

Leu Ser Gly Gly Gin Lys Gin Arg He Ala He Ala Arg Ala Leu Val 

1075 1080 1085 

Arg Gin Pro His lie Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp 

1090 1095 1100 

Thr Glu Ser Glu Lys Val Val Gin Glu Ala Leu Asp Lys Ala Arg Glu 
il05 1110 1115 1120 

Gly Arg Thr Cys He Val He Ala His Arg Leu Ser Thr He Gin Asn 

1125 1130 1135 

Ala Asp Leu He Val Val He Glu Asn Gly Lys Val Lys Glu His Gly 

1140 1145 1150 

Thr His Gin Gin Leu Leu Ala Gin Lys Gly He Tyr Phe Ser Met Val 

1155 1160 ' " 1165 

Gin Ala Gly Ala Lys Arg Ser 
1170 1175 



<210> 5 

<211> 1066 

<212:> PRT 

<213> Homo sapiens 





<300> 










<308> 


Genbank 


P21439 






<309> 


1998-07- 


-15 






<400> 


5 






Trp 


Lys Leu 


Thr Leu 


Val He 


Met 


1 




5 






Ser 


Ala Ala 


Val Trp 


Ala Lys 


lie 






20 






Leu 


Ala Ala 


Tyr Ala 


Lys Ala Gly 




35 






40 


Ala 


He Arg 


Thr Val 


He Ala 


Phe 




50 




55 




Arg 


Tyr Gin 


Lys His 


Leu Glu 


Asn 


65 






70 




Ala 


lie Ser 


Ala Asn 


He Ser 


Met 






85 







Ala He Ser Pro He Leu Gly Leu 

10 15 

Leu Ser Ala Phe Ser Asp Lys Glu 
25 30 

Ala Val Ala Glu Glu Ala Leu Gly 
45 

Gly Gly Gin Asn Lys Glu Leu Glu 
60 

Ala Lys Glu He Gly He Lys Lys 

75 80 

Gly He Ala Phe Leu Leu He Tyr 
90 95 
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Ala Ser 

Lys Glu 

He Gly 
130 
Ala Asn 
145 

Asn Pro 

He Lys 

Arg Ala 

Gly Gin 
210 
Thr Val 
225 

Asn He 

Glu He 

He Ala 

He Lys 
290 
Leu Pro 
305 

Ser Gly 

Asn Pro 

Glu Ser 

Arg Thr 
370 
Asp Val 
385 

His Ser 

Met Gin 

Asp Glu 

Leu Phe 
450 
Gin Lys 
465 

Pro Val 
Tyr Phe 
Pro Ala 



Tyr Ala Leu Ala Phe Trp 
100 

Tyr Thr He Gly 
115 

Ala Phe Ser Val 



Gly Asp 
530 
Phe Leu 
545 

Phe Thr 



Ala Arg Gly Ala 
150 

Lys He Asp Ser 
165 

Gly Asn Leu Glu 
180 

Asn Val Lys He 
195 

Thr Val Ala Leu 

Gin Leu "He Gin 
230 

Asp Gly Gin Asp 
245 

He Gly Val Val 
260 

Glu Asn He Cys 
275 

Lys Ala Val Lys 

Gin Lys Phe Asp 
310 

Gly Gin Lys Gin 
325 

Lys He Leu Leu 
340 

Glu Ala Glu Val 
355 

Thr He Val He 

He Ala Gly Phe 
390 

Glu Leu Met Lys 
405 

Thr Ser Gly Ser 
420 

Lys Ala Ala Thr 
435 

Arg His Ser Thr 

Ser Leu Asp Val 
470 

Ser Phe Leu Lys 
485 

Val Val Gly Thr 
500 

Phe Ser Val He 
515 

Asp Ala Val Lys 



Asn Ala 
120 
Gly Gin 
135 

Ala Tyr 



Phe Leu Gly He 
550 

Phe Gly Lys Ala 
565 

Met Ala Phe Lys Ala Met 
580 



Phe Ser 

Phe Asn 

Leu Lys 
200 
Val Gly 
215 

Arg Leu 

He Arg 

Ser Gin 

Tyr Gly 
280 
Glu Ala 
295 

Thr Leu 

Arg He 

Leu Asp 

Gin Ala 
360 
Ala His 
375 

Glu Asp 

Lys Glu 

Gin He 

Arg Met 
440 
Gin Lys 
455 

Glu Thr 

Val Leu 

Val Cys 

Phe Ser 
520 
Gin Gin 
535 

He Ser 
Gly Glu 
Leu Arg 



Tyr Gly Ser 
105 

Met Thr Val 

Ala Ala Pro 

Val He Phe 
155 

Glu Arg Gly 

170 
Asp Val His 
185 

Gly Leu Asn 

Ser Ser Gly 

Tyr Asp Pro 
235 

Asn Phe Asn 

250 
Glu Pro Val 
265 

Arg Gly Asn 

Asn Ala Tyr 

Val Gly Glu 
315 

Ala He Ala 

330 
Glu Ala Thr 
345 

Ala Leu Asp 

Arg Leu Ser 

Gly Val He 
395 

.Gly Val Tyr 

410 
Gin Ser Glu 
425 

Ala Pro Asn 

Asn Leu Lys 

Asp Gly Leu 
475 

Lys Leu Asn 
490 

Ala He Ala. 
505 

Glu lie lie 

Lys Cys Asn 

Phe Phe Thr 
555 

He Leu Thr 

570 
Gin Asp Met 
585 



Thr Leu 

Phe Phe 
125 
Cys He 
140 

Asp He 

His Lys 

Phe Ser 

Leu Lys 
205 
Cys Gly 
220 

Asp Glu 

Val Asn 

Leu Phe 

Val Thr 
285 
Glu Phe 
300 

Arg Gly 
Arg Ala 
Ser Ala 



Val He Ser 
110 

Ser He Leu 

Asp Ala Phe 

He Asp Asn 
160 

Pro Asp Ser 

175 
Tyr Pro Ser 
190 

Val Gin Ser 

Lys Ser Thr 

Gly Thr He 
240 

Tyr Leu Arg 

255 
Ser Thr Thr 
270 

Met Asp Glu 

lie Met Lys 

Ala Gin Leu 
320 

Leu Val Arg 

335 
Leu Asp Thr 
350 

Arg Glu Gly 



Lys Ala 

365, 

Thr Val Arg Asn Ala 
380 

Val Glu 



Phe Lys 

Glu Phe 

Gly Trp 
445 
Asn Ser 
460 

Glu Ala 

Lys Thr 

Asn Gly 

Ala He 
525 
He Phe 
540 

Phe Phe 
Arg Arg 
Ser Trp 



Gin Gly Ser 
400 

Leu Val Asn 

415 
Glu Leu Asn 
430 

Lys Ser Arg 

Gin Met Cys 

Asn Val Pro 
480 

Glu Trp Pro 

495 
Gly Leu Gin 
510 

Phe Gly Pro 

Ser Leu He 

Leu Gin Gly 
560 

Leu Arg Ser 

575 
Phe Asp Asp 
590 
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His 


Lys 


Asn 


Ser 


Thr 


Gly 


Ala 


Leu 


Ser Thr 


Arg 


Leu 


Ala 


Thr 


Asp 


Ala 






595 










600 








605 








Ala 


Gin 


Val 


Gin 


Gly 


Ala 


Thr 


Gly 


Thr Arg 


Leu 


Ala 


Leu 


lie 


Ala 


Gin 




610 










615 








620 










Asn 


He 


Ala 


Asn 


Leu 


Gly 


Thr 


Gly 


He He 


He 


Ser 


Phe 


He 


Tyr 


Gly 


625 










630 








635 










640 


Trp 


Gin 


Leu *Thr 


Leu 


Leu 


Leu 


Leu 


Ala Val 


Val 


Pro 


He 


He 


Ala 


Val 








645 








650 










o5o 




Ser 


Gly 


lie Val 


Glu 


Met 


Lys 


Leu 


Leu Ala 


Gly 


Asn 


Ala 


Lys 


Arg 


Asp 








660 










665 








670 






Lys 


Lys 


Glu 


Leu 


Glu 


Ala 


Ala 


Gly 


Lys He 


Ala 


Thr 


Glu 


Ala 


He 


Glu 




675 










680 






685 








Asn 


He 


Arg 


Thr 


Val 


Val 


Ser 


Leu 


Thr Gin 


Glu 


Arg 


Lys 


Phe 


Glu 


Ser 




690 










695 








700 










Met 


Tyr 


Val 


Glu 


Lys 


Leu 


Tyr 


Gly 


Pro Tyr 


Arg 


Asn 


Ser 


Val 


Gin 


Lys 


705 










710 








715 










720 


Ala 


His 


He 


Tyr 


Gly 


He 


Thr 


Phe 


Ser He 


Ser 


Gin 


Ala 


Phe 


Met 


Tyr 










725 








730 










735 




Phe 


Ser 


Tyr 


Ala 


Gly 


Cys 


Phe 


Arg 


Phe Gly 


Ala 


Tyr 


Leu 


He 


Val 


Asn 








740 










745 








750 






Gly 


His 


Met 


Arg 


Phe 


Arg 


Asp 


Val 


He Leu 


Val 


Phe 


Ser 


Ala 


He 


Val 






755 










760 








765 








Phe 


Gly 


Ala 


Val 


Ala 


Leu 


Gly 


His 


Ala Ser 


Ser 


Phe 


Ala 


Pro 


Asp 


Tyr 




770 










775 








780 










Ala 


Lys 


Ala 


Lys 


Leu 


Ser 


Ala 


Ala 


His Leu 


Phe 


Met 


Leu 


Phe 


Glu 


Arg 


785 










790 








795 










800 


Gin 


Pro 


Leu 


He 


Asp 


Ser 


Tyr 


Ser 


Glu Glu 


Gly 


Leu 


Lys 


Pro 


Asp 


Lys 










805 








810 










815 




Phe 


Glu 


Gly 


Asn 


He 


Thr 


Phe 


Asn 


Glu Val 


Val 


Phe 


Asn 


Tyr 


Pro 


Thr 






820 










825 








830 






Arg 


Ala 


Asn 


Val 


Pro 


Val 


Leu 


Gin 


Gly Leu 


Ser 


Leu 


Glu 


Val 


Lys 


Lys 






835 










840 








845 








Gly 


Gin 


Thr 


Leu 


Ala 


Leu 


Val 


Gly 


Ser Ser 


Gly 


Cys 


Gly 


Lys 


Ser 


Thr 




850 










855 








860 










Val 


Val 


Gin 


Leu 


Leu 


Glu 


Arg 


Phe 


Tyr Asp 


Pro 


Leu 


Ala 


Gly 


Thr 


Val 


865 










870 






875 










oon 

oou 


Leu 


Leu 


Asp 


Gly 


Gin 


Glu 


Ala 


Lys 


Lys Leu 


Asn 


Val 


Gin 


Trp 


Leu 


Arg 










885 








890 










895 




Ala 


Gin 


Leu 


Gly 


He 


Val 


Ser 


Gin 


Glu Pro 


He 


Leu 


Phe 


Asp 


Cys 


Ser 








900 










905 








910 






lie 


Ala 


Glu 


Asn 


He 


Ala 


Tyr 


Gly 


Asp Asn 


Ser 


Arg 


Val 


Val 


Ser 


Gin 






915 










920 






925 








Asp 


Glu 


He 


Val 


Ser 


Ala 


Ala 


Lys 


Ala Ala 


Asn 


He 


His 


Pro 


Phe 


He 




930 










935 






940 










Glu 


Thr 


Leu 


Pro 


His 


Lys 


Tyr 


Glu 


Thr Arg 


Val 


Gly 


Asp 


Lys 


Gly 


Thr 


945 










950 








955 










960 


Gin 


Leu 


Ser 


Gly 


Gly 


Gin 


Lys 


Gin 


Arg He 


Ala 


He 


Ala 


Arg 


Ala 


Leu 










965 








970 










975 




He 


Arg 


Gin 


Pro 


Gin 


He 


Leu 


Leu 


Leu Asp 


Glu 


Ala 


Thr 


Ser 


Ala 


Leu 








980 










985 








990 






Asp 


Thr 


Glu 


Ser 


Glu 


Lys 


Val 


Val 


Gin Glu 


Ala 


Leu 


Asp 


Lys 


Ala 


Arg 






995 










1000 






1005 






Glu 


Gly 


Arg 


Thr 


Cys 


He 


Val 


He 


Ala His 


Arg 


Leu 


Ser 


Thr 


He 


Gin 




1010 








1015 






1020 








Asn 


Ala Asp 


Leu 


He 


Val 


Val 


Phe 


Gin Asn 


Gly 


Arg Val Lys 


Glu 


His 


1025 








1030 






1035 








1040 


Gly Thr His 


Gin 


Gin 


Leu Leu Ala Gin Lys 


Gly He Tyr Phe 


Ser 


Met 










1045 






1050 








1055 



Val Ser Val Gin Ala Gly Thr Gin Asn Leu 
1060 1065 



<210> 6 
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<211> 1266 

<212> PRT 

<213> Mus mus cuius 

<300> 



<308> 


Genbank 


P21440 




<309> 


1997-11- 


-01 






<400> 


D 










Trp Lys Leu 


Thr 


Leu 


Val 


He 


Met 


1 




5 








Ser Thr Ala 


Val 


Trp 


Ala 


Lys 


He 




20 










Leu Ala Ala 


Tyr 


Ala 


Lys 


Ala 


Gly 


35 










40 


Ala lie Arg 


Thr 


Val 


He 


Ala 


Phe 


50 








55 




Arg Tyr Gin 


Lys 


His 


Leu 


Glu 


Asn 


65 






70 






Ala He Ser 


Ala 


Asn 


He 


Ser 


Met 






o c 
03 








Ala Ser Tyr 


Ala 


Leu 


Ala 


Phe 


Trp 




100 










Lys Glu Tyr 


Thr 


He 


Gly 


Asn 


Ala 


115 










120 


He Gly Ala 


Phe 


Ser 


Val 


Gly 


Gin 


130 








135 




Ala Asn Ala 


Arg 


Gly 


Ala 


Ala 


Tyr 


145 






150 






Asn Pro Lys 


lie 


Asp 


Ser 


Phe 


Ser 






1 DO 








He Lys Gly 


Asn 


Leu 


Glu 


Phe 


Ser 














Arg Ala Asn 


lie 


Lys 


He 


Leu 


Lys 


195 










200 


Gly Gin Thr 


val 


Ala 


Leu 


Val 


Gly 


210 








215 




Thr Val Gin 


Leu 


Leu 


Gin 


Arg 


Leu 


225 






230 






Ser He Asp 






Asp 


He 


Arg 














Glu He He 




Val 


Val 


Ser 


Gin 




260 










He Ala Glu 


Asn 


lie 


Arg 


Tyr 


Gly 


275 










280 


He Glu Lys 


Ala 


Val 


Lys 


Glu 


Ala 


290 








295 




Leu Pro Gin 


Lys 


Phe 


Asp 


Thr 


Leu 


305 






310 






Ser Gly Gly 


Gin 


Lys 


Gin 


Arg 


He 




325 








Asn Pro Lys 


He 


Leu 


Leu 


Leu 


Asp 




340 








Glu Ser Glu 


Ala 


Glu 


Val 


Gin 


Ala 


355 










360 


Arg Thr Thr 


He 


Val 


He 


Ala 


His 


370 








375 




Asp Val He 


Ala 


Gly 


Phe 


Glu 


Asp 


385 






390 






His Ser Glu 


Leu 


Met 


Lys 


Lys 


Glu 






405 






Met Gin Thr 


Ala 


Gly 


Ser 


Gin 


He 



13 



Ala 


lie Ser Pro 


He Leu Gly Leu 




10 


15 


Leu 


Ser Thr Phe 


Ser Asp Lys Glu 


25 




30 


Ala 


Val Ala Glu 


Glu Ala Pro Gly 






45 


Gly 


Gly Gin Asn 


Lys Glu Leu Glu 




60 




Ala 


Lys Lys He 


Gly He Lys Lys 




75 


80 


Gly 


He Ala Phe 


Leu Leu He Tyr 




90 


95 


Tyr 


Gly Ser Thr 


Leu Val He Ser 


105 




110 


Met 


Thr Val Phe 


Phe Ser He Leu 






125 


Ala 


Ala Pro Cys 


He Asp Ala Phe 




140 




Val 


He Phe Asp 


He He Asp Asn 




155 


160 


Glu 


Arg Gly His 


Lys Pro Asp Asn 




170 


175 


Asp 


Val His Phe 


Ser Tyr Pro Ser 


185 




190 


Gly 


Leu Asn Leu 


Lys Val Lys Ser 






205 


Asn 


Ser Gly Cys 


Gly Lys Ser Thr 




220 




Tyr 


Asp Pro Thr 


Glu Gly Lys He 




235 


240 


Asn 


Phe Asn Val 


Arg Cys Leu Arg 




250 


255 


Glu 


Pro Val Leu 


Phe Ser Thr Thr 


265 




270 


Arg 


Gly Asn Val 


Thr Met Asp Glu 






285 


Asn 


Ala Tyr Asp 


Phe He Met Lys 




300 




Val 


Gly Asp Arg 


Gly Ala Gin Leu 




315 


320 


Ala 


lie Ala Arg 


Ala Leu Val Arg 




330 


335 


Glu 


Ala Thr Ser 


Ala Leu Asp Thr 


345 




350 


Ala 


Leu Asp Lys 


Ala Arg Glu Gly 






365 


Arg 


Leu Ser Thr 


He Arg Asn Ala 




380 




Gly 


Val He Val 


Glu Gin Gly Ser 




395 


400 


Gly 


He Tyr Phe 


Arg Leu Val Asn 




410 


415 


Leu 


Ser Glu Glu 


Phe Glu Ala Arg 
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420 



Ala 


Leu Val 


Arg Asn Pro 


Lys He 




435 




440 


Ala 


Leu Asp 


Thr Glu Ser 


Glu Ala 




450 




455 


Ala 


Arg Lys 


Gly Arg Thr 


Thr He 


465 




470 




Val 


Arg Asn 


Ala Asp Val 


He Ala 






485 




Glu 


Lys Gly 


Asn His Asp 


Glu Leu 






500 




Lys 


Leu Val 


Thr Met Gin 


Thr Ala 




515 




520 


Ala 


Ala Ala 


Arg Ala Leu 


Val Arg 




530 




535 


Glu 


Ala Thr 


Ser Ala Leu 


Asp Thr 


545 




550 




Ala 


Leu Asp 


Lys Ala Arg 


Glu Gly 






565 




Arg 


Leu Ser 


Thr Val Arg 


Asn Ala 






580 




Gly 


Val He 


Val Glu Gin 


Gly Asn 




595 




600 


Gly 


He Tyr 


Phe Lys Leu 


Val Met 




610 




615 


Glu 


Pro Gly 


Asn Asn Ala 


Val Glu 


625 




630 




Asp 


Val Ala 


Pro Asn Gly 


Trp Lys 






645 




Lys 


Lys Ser 


Leu Lys Ser 


Pro His 






660 




Asn 


Glu Leu 


Asp Ala Asn 


Val Pro 




675 




680 


Lys 


Leu Asn 


Lys Thr Glu 


Trp Pro 




690 




695 


Ala 


lie Ala 


Asn Gly Ala 


Leu Gin 


705 




710 




Glu 


Met He 


Ala He Phe 


Gly Pro 






725 


Lys 


Cys Asn 


Met Phe Ser 


Leu Val 






740 




Phe 


Phe Thr 


Phe Phe Leu 


Gin Gly 




755 




760 


lie 


Leu Thr 


Thr Arg Leu 


Arg Ser 




770 




775 


Gin 


Asp Met 


Ser Trp Phe 


Asp Asp 


785 




790 




Ser 


Thr Arg 


Leu Ala Thr 


Asp Ala 






805 




Thr 


Lys Leu 


Ala Leu He 


Ala Gin 






820 




lie 


lie He 


Ser Phe He- 


Tyr Gly 




835 




840 


Ser 


Val Val 


Pro Phe He 


Ala Val 




850 




855 


Leu 


Ala Gly 


Asn Ala Lys 


Arg Asp 


865 




870 




Lys 


He Ala 


Thr Glu Ala 


He Glu 






885 




Thr 


Gin Glu 


Arg Lys Phe 


Glu Ser 






900 




Pro 


Tyr Arg 


Asn Ser Val 


Arg Lys 



14. 



425 






430 






Leu 


Leu Leu 


Asp Glu 


Ala 


Thr 


Ser 






445 








Val 


Val Gin 


Val Ala 


Leu 


Asp 


Lys 






460 








Val 


He Ala 


His Arg 


Leu 


Ser 


Thr 




475 








480 


Gly 


Phe Asp 


Asp Gly 


Val 


He 


Val 




490 






495 




Met 


Lys Glu 


Lys Gly 


He 


Tyr 


Phe 


505 






510 






Gly 


Asn Glu 


Val Glu 


Leu 


Glu 


Asn 




525 








Asn 


Pro Lys 


He Leu 


Leu 


Leu 


Asp 






540 








Glu 


Ser Glu 


Ala Val 


Val 


Gin 


Ala 




555 








560 


Arg 


Thr Thr 


He Val 


He 


Ala 


His 


570 






575 




Asp 


Val He 


Ala Gly 


Phe 


Asp 


Gly 


585 






590 






His 


Asp Glu 


Leu Met 


Arg 


Glu 


Lys 






605 








Thr 


Gin Thr 


Arg Gly 


Asn 


Glu 


He 






620 








Leu 


Ser Asp 


Glu Lys 


Ala 


Ala 


Gly 




635 








640 


Ala 


Arg He 


Phe Arg 


Asn 


Ser 


Thr 




650 






655 




Gin 


Asn Arg 


Leu Asp 


Glu 


Glu 


Thr 


665 






670 






Pro 


Val Ser 


Phe Leu 


Lys 


Val 


Leu 






685 






Tyr 


Phe Val 


Val Gly 


Thr 


Val 


Cys 






700 








Pro 


Ala Phe 


Ser lie 


He 


Leu 


Ser 




715 








720 


Gly 


Asp Asp 


Ala Val 


Lys 


Gin 


Gin 




730 






735 




Phe 


Leu Gly 


Leu Gly 


Val 


Leu 


Ser 


745 






750 






Phe 


Thr Phe 


Gly Lys 


Ala 


Gly 


Glu 






765 








Met 


Ala Phe 


Lys Ala 


Met 


Leu 


Arg 






780 








His 


Lys Asn 


Ser Thr 


Gly 


Ala 


Leu 




795 








800 


Ala 


Gin Val 


Gin Gly 


Ala 


Thr 


Gly 




810 






815 




Asn 


Thr Ala 


Asn Leu 


Gly 


Thr 


Gly 


825 






830 






Trp 


Gin Leu 


Thr Leu 


Leu 


Leu 


Leu 




845 








Ala 


Gly He 


Val Glu 


Met 


Lys 


Met 




860 








Lys 


Lys Glu 


Met Glu 


Ala 


Ala 


Gly 




875 








880 


Asn 


He Arg 


Thr Val 


Val 


Ser 


Leu 




890 






895 




Met 


Tyr Val 


Glu Lys 


Leu 


His 


Gly 


905 






910 






Ala 


His lie 


Tyr Gly 


He 


Thr 


Phe 
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15. ~~ 

915 920 925 

Ser He Ser Gin Ala Phe Met Tyr Phe Ser Tyr Ala Gly Cys Phe Arg 

930 935 940 

Phe Gly Ser Tyr Leu He Val Asn Gly His Met Arg Phe Lys Asp Val 
945 950 955 960 

He Leu Val Phe Ser Ala He Val Leu Gly Ala Val Ala Leu Gly His 

965 970 975 

Ala Ser Ser Phe Ala Pro Asp Tyr Ala Lys Ala Lys Leu Ser Ala Ala 

980 985 990 

Tyr Leu Phe Ser Leu Phe Glu Arg Gin Pro Leu He Asp Ser Tyr Ser 

995 1000 1005 

Gly Glu Gly Leu Trp Pro Asp Lys Phe Glu Gly Ser Val Thr Phe Asn 

1010 1015 1020 

Glu Val Val Phe Asn Tyr Pro Thr Arg Ala Asn Val Pro Val Leu Gin 
1025 1030 1035 1040 

Gly Leu Ser Leu Glu Val Lys Lys Gly Gin Thr Leu Ala Leu Val Gly 

1045 1050 1055 

Ser Ser Gly Cys Gly Lys Ser Thr Val Val Gin Leu Leu Glu Arg Phe 

1060 1065 1070 

Tyr Asp Pro Met Ala Gly Ser Val Leu Leu Asp Gly Gin Glu Ala Lys 

1075 1080 1085 

Lys Leu Asn Val Gin Trp Leu Arg Ala Gin Leu Gly He Val Ser Gin 

1090 * 1095 1100 

Glu Pro He Leu Phe Asp Cys Ser He Ala Glu Asn He Ala Tyr Gly 
1105 1110 1115 1120 

Asp Asn Ser Arg Val Val Pro His Asp Glu He Val Arg Ala Ala Lys 

1125 1130 1135 

Glu Ala Asn He His Pro Phe He Glu Thr Leu Pro Gin Lys Tyr Asn 

1140 1145 1150 

Thr Arg Val Gly Asp Lys Gly Thr Gin Leu Ser Gly Gly Gin Lys Gin 

1155 1160 1165 

Arg He Ala He Ala Arg Ala Leu He Arg Gin Pro Arg Val Leu Leu 

1170 1175 1180 

Leu Asp Glu Ala Thr Ser Ala Leu Asp Thr Glu Ser Glu Lys Val Val 
1185 1190 1195 1200 

Gin Glu Ala Leu Asp Lys Ala Arg Glu Gly Arg Thr Cys He Val He 

1205 1210 1215 

Ala His Arg Leu Ser Thr He Gin Asn Ala Asp Leu He Val Val He 

1220 1225 1230 

Glu Asn Gly Lys Val Lys Glu His Gly Thr His Gin Gin Leu Leu Ala 

1235 1240 1245 

Gin Lys Gly He Tyr Phe Ser Met Val Asn lie Gin Ala Gly Thr Gin 

1250 1255 1260 

Asn Leu 
1265 

<210> 7 
<211> 1207 
<212> PRT 

<213> Arabidopsis thaliana 
<300> 

<308> Genbank A42150 
<309> 1997-03-13 

<400> 7 

Glu Lys Met Met Glu Glu Val Leu Lys Tyr Ala Leu Tyr Phe Leu Val 

1 5 10 15 

Val Gly Ala Ala He Trp Ala Ser Ser Trp Ala Glu He Ser Cys Trp 

20 25 30 

Met Trp Ser Gly Glu Arg Gin Thr Thr Lys Met Arg He Lys Tyr Leu 
35 40 45 
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Glu Ala Ala 


Leu Asn Gin Asp 


He Gin Phe 


Phe Asp 


Thr 


Glu 


Val 


Arg 


50 


55 




60 










Thr Ser Asp 


Val Val Phe Ala 


He Asn Thr 


Asp Ala 


Val 


Met 


Val 


Gin 


65 


70 




75 








80 


Asp Ala lie 


Ser Glu Lys Leu 


Gly Asn Phe 


He His 


Tyr 


Met 


Ala 


Thr 


85 


90 








95 




Phe Val Ser 


Gly Phe lie Val 


Gly Phe Thr 


Ala Val 


Trp 


Gin 


Leu 


Ala 




100 


105 






110 






Leu Val Thr 


Leu Ala Val Val 


Pro Leu He 


Ala Val 


He 


Gly 


Gly 


He 


115 




120 




125 








His Thr Thr 


Thr Leu Ser Lys 


Leu Ser Asn 


Lys Ser 


Gin 


Glu 


Ser 


Leu 


130 


135 




140 










Ser Gin Ala 


Gly Asn He Val 


Glu Gin Thr 


Val Val 


Gin 


He 


Arg 


Val 


145 


150 




155 








160 


Val Met Ala 


Phe Val Gly Glu 


Ser Arg Ala 


Ser Gin 


Ala 


Tyr 


Ser 


Ser 




165 


170 








175 




Ala Leu Lys 


He Ala Gin Lys 


Leu Gly Tyr 


Lys Thr 


Gly 


Leu 


Ala 


Lys 




180 


185 






190 






Gly Met Gly 


Leu Gly Ala Thr 


Tyr Phe Val 


Val Phe 


Cys 


Cys 


Tyr 


Ala 


195 




200 




205 








Leu Leu Leu 


Trp Tyr Gly Gly 


Tyr Leu Val 


Arg His 


His 


Leu 


Thr 


Asn 


210 


215 




220 










Gly Gly Leu 


Ala He Ala Thr 


Met Phe Ala 


Val Met 


He 


Gly 


Gly 


Leu 


225 


230 




235 








240 


Ala Leu Gly 


Gin Ser Ala Pro 


Ser Met Ala 


Ala Phe 


Ala 


Lys 


Ala 


Lys 




245 


250 








255 




Val Ala Ala 


Ala Lys He Phe 


Arg He He 


Asp His 


Lys 


Pro 


Thr 


He 




260 


265 






270 






Glu Arg Asn 


Ser Glu Ser Gly 


Val Glu Leu 


Asp Ser 


Val 


Thr 


Gly 


Leu 


275 




280 




285 








Val Glu Leu 


Lys Asn- Val Asp 


Phe Ser Tyr 


Pro Ser 


Arg 


Pro 


Asp 


Val 


290 


295 




300 










Lys lie Leu 


Asn Asn Phe Cys 


Leu Ser Val 


Pro Ala 


Gly 


Lys 


Thr 


He 


305 


310 




315 








320 


Ala Leu Val 


Gly Ser Ser Gly 


Ser Gly Lys 


Ser Thr 


Val 


Val 


Ser 


Leu 




325 


330 








335 




lie Glu Arg 


Phe Tyr Asp Pro 


Asn Ser Gly 


Gin Val 


Leu 


Leu 


Asp 


Gly 




340 


345 






350 






Gin Asp Leu 


Lys Thr Leu Lys 


Leu Arg Trp 


Leu Arg 


Gin 


Gin 


He 


Gly 


355 




360 




365 








Leu Val Ser 


Gin Glu Pro Ala 


Leu Phe Ala 


Thr Ser 


He 


Lys 


Glu 


Asn 


370 


375 




380 










lie Leu Leu 


Gly Arg Pro Asp Ala Asp Gin 


Val Glu 


He 


Glu 


Glu 


Ala 


385 


390 




395 








400 


Ala Arg Val 


Ala Asn Ala His Ser Phe He 


He Lys 


Leu 


Pro 


Asp 


Gly 




405 


410 








415 




Phe Asp Thr 


Gin Val Gly Glu Arg Gly Leu 


Gin Leu 


Ser 


Gly 


Gly 


Gin 




420 


425 






430 






Lys Gin Arg 


He Ala lie, Ala Arg Ala Met 


Leu Lys 


Asn 


Pro 


Ala 


lie 


435 




440 




445 








Leu Leu Leu 


Asp Glu Ala Thr 


Ser Ala Leu 


Asp Ser 


Glu 


Ser 


Glu 


Lys 


450 


455 




460 










Leu Val Gin 


Glu Ala Leu Asp Arg Phe Met 


He Gly 


Arg 


Thr 


Thr 


Leu 


465 


470 




475 








480 


lie lie Ala 


His Arg Leu Ser Thr He Arg 


Lys Ala 


Asp 


Leu 


Val 


Ala 




485 


490 








495 




Val Leu Gin 


Gin Gly Ser Val Ser Glu He 


Gly Thr 


His 


Asp 


Glu 


Leu 




500 


505 




510 






Phe Ser Lys 


Gly Glu Asn Gly Val Tyr Ala 


Lys Leu 


He 


Lys 


Met 


Gin. 


515 




520 




525 








Glu Ala Ala 


His Glu Thr Ala 


Met Ser Asn 


Ala Arg 


Lys 


Ser 


Ser 


Ala 


530 


535 




540 
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Arg Pro Ser Ser Ala Arg Asn Ser Val Ser Ser Pro lie Met Thr Arg 
545 550 555 560 

Asn Ser Ser Tyr Gly Arg Ser Pro Tyr Ser Arg Arg Leu Ser Asp Phe 

565 " 570 575 

Ser Thr Ser Asp Phe Ser Leu Ser lie Asp Ala Ser Ser Tyr Pro Asn 

580 585 590 

Tyr Arg Asn Glu Lys Leu Ala Phe Lys Asp Gin Ala Asn Ser Phe Trp 

595 600 605 

Arg Leu Ala Lys Met Asn Ser Pro Glu Trp Lys Tyr Ala Leu Leu Gly 

610 ~ 615 620 

Ser Val Gly Ser Val lie Cys Gly Ser Leu Ser Ala Phe Phe Ala Tyr 
625 630 635 640 

Val Leu Ser Ala Val Leu Ser Val Tyr Tyr Asn Pro Asp His Glu Tyr 

645 650 655 

Met lie Lys Gin He Asp Lys Tyr Cys Tyr Leu Leu lie Gly Leu Ser 

660 665 670 

Ser Ala Ala Leu Val Phe Asn Thr Leu Gin His Ser Phe Trp Asp He 

675 680 685 

Val Gly Glu Asn Leu Thr Lys Arg Val Arg Glu Lys Met Leu Ser Ala 

690 695 700 

Val Leu Lys Asn Glu Met Ala Trp Phe Asp Gin Glu Glu Asn Glu Ser 
705 710 715 720 

Ala Arg lie Ala Ala Arg Leu Ala Leu Asp Ala Asn Asn Val Arg Ser 

725 730 735 

Ala He Gly Asp Arg He Ser Val He Val Gin Asn Thr Ala Leu Met 

740 745 750 

Leu Val Ala Cys Thr Ala Gly Phe Val Leu Gin Trp Arg Leu Ala Leu 

755 760 765 

Val Leu Val Ala Val Phe Pro Val Val Val Ala Ala Thr Val Leu Gin 

770 775 780 

Lys Met Phe Met Thr Gly Phe Ser Gly Asp Leu Glu Ala Ala His Ala 
785 790 795 800 

Lys Gly Thr Gin Leu Ala Gly Glu Ala He Ala Asn Val Arg Thr Val 

805 810 815 

Ala Ala Phe Asn Ser Glu Ala Lys He Val Arg Leu Tyr Thr Ala Asn 

820 " 825 830 

Leu Glu Pro Pro Leu Lys Arg Cys Phe Trp Lys Gly Gin He Ala Gly 

835 840 845 

Ser Gly Tyr Gly Val Ala Gin Phe Cys Leu Tyr Ala Ser Tyr Ala Leu 

850 855 860 

Gly Leu Trp Tyr Ala Ser Trp Leu Val Lys His Gly He Ser Asp Phe 
865 870 875 880 

Ser Lys Thr He Arg Val Phe Met Val Leu Met Val Ser Ala Asn Gly 

885 890 895 

Ala Ala Glu Thr Leu Thr Leu Ala Pro Asp Phe He Lys Gly Gly Gin 

900 905 . 910 

Ma Met Arg Ser Val Phe Glu Leu Leu Asp Arg Lys Thr Glu He Glu 

915 920 925 

Pro Asp Asp Pro Asp Thr Thr Pro Val Pro Asp Arg Leu Arg Gly Glu 

930 935 940 

Val Glu Leu Lys His He Asp Phe Ser Tyr Pro Ser Arg Pro Asp He 
945 950 955 960 

Gin He Phe Arg Asp Leu Ser Leu Arg Ala Arg Ala Gly Lys Thr Leu 

965 970 975 

Ala Leu Val Gly Pro Ser Gly Cys Gly Lys Ser Ser Val He Ser Leu 

980 " 985 990 

He Gin Arg Phe Tyr Glu Pro Ser Ser Gly Arg Val Met He Asp Gly 

995 1000 1005 

Lys Asp He Arg Lys Tyr Asn Leu Lys Ala He Arg Lys His He Ala 

1010 1015 1020 

He Val Pro Gin Glu Pro Cys Leu Phe Gly Thr Thr He Tyr Glu Asn 
1025 1030 1035 1040 
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He Ala Tyr Gly His Glu Cys Ala Thr Glu Ala Glu He He Gin Ala 

1045 1050 1055 

Ala Thr Leu Ala Ser Ala His Lys Phe He Ser Ala Leu Pro Glu Gly 

1060 1065 1070 

Tyr Lys Thr Tyr Val Gly Glu Arg Gly Val Gin Leu Ser Gly Gly Gin 

1075 1080 1085 

Lys Gin Arg lie Ala He Ala Arg Ala Leu Val Arg Lys Ala Glu He 

1090 1095 1100 

Met Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp Ala Glu Ser Glu Arg 
1105 * 1110 1115 1120 

Ser Val Gin Glu Ala Leu Asp Gin Ala Cys Ser Gly Arg Thr Ser He 

1125 1130 1135 

Val Val Ala His Arg Leu Ser Thr He Arg Asn Ala His Val He Ala 

. 1140 . 1145 1150 

Val He Asp Asp Gly Lys Val Ala Glu Gin Gly Ser His Ser His Leu 

1155 1160 1165 

Leu Lys Asn His Pro Asp Gly He Tyr Ala Arg Met He Gin Leu Gin 

1170 1175 1180 

Arg Phe Thr His Thr Gin Val He Gly Met Thr Ser Gly Ser Ser Ser 
1185 1190 1195 1200 

Arg Val Lys Glu Asp Asp Ala 
1205 

<210> 8 
<211> 1161 
<212> PRT 

<213> Arabidopsis thaliana 
<300> 

<308> Genbank CAA71277 
<309> 1997-05-19 



<400> 8 



Lys Gin Ala 


Ser 


His 


Arg 


Val 


Ala 


Lys 


Tyr 


Ser 


Leu 


Asp Phe Val Tyr 


1 




5 










10 






15 


Leu Ser Val 


Ala 


He 


Leu 


Phe 


Ser 


Ser 


Trp 


Leu 


Glu 


Val Ala Cys Trp 




20 










25 








30 


Met His Thr 


Gly 


Glu 


Arg 


Gin 


Ala 


Ala 


Lys 


Met 


Arg 


Arg Ala Tyr Leu 


35 










40 










45 


Arg Ser Met 


Leu 


Ser 


Gin 


Asp 


He 


Ser 


Leu 


Phe 


Asp 


Thr Glu Ala Ser 


50 








55 










60 




Thr Gly Glu 


Val 


He 


Ser 


Ala 


He 


Thr 


Ser Asp 


He 


Leu Val Val Gin 


65 






70 










75 




80 


Asp Ala Leu 


Ser 


Glu 


Lys 


Val 


Gly 


Asn 


Phe 


Leu 


His 


Tyr He Ser Arg 






85 










90 






95 


Phe He Ala 


Gly 


Phe 


Ala 


He 


Gly 


Phe 


Thr 


Ser 


Val 


Trp Gin He Ser 




100 










105 








110 


Leu Val Thr 


Leu 


Ser 


He 


Val 


Pro 


Leu 


He 


Ala 


Leu 


Ala Gly Gly He 


115 










120 










125 


Tyr Ala Phe 


Val 


Ala 


He 


Gly 


Leu 


He 


Ala Arg 


Val 


Arg Lys Ser Tyr 


130 








135 










140 




He Lys Ala 


Gly 


Glu 


He 


Ala 


Glu 


Glu 


Val 


He 


Gly 


Asn Val Arg Thr 


145 






150 










155 




160 


Val Gin Ala 


Phe 


Thr 


Gly 


Glu 


Glu 


Arg 


Ala 


Val 


Arg 


Leu Tyr Arg Glu 






165 










170 






175 


Ala Leu Glu 


Asn 


Thr 


Tyr 


Lys 


Tyr 


Gly 


Arg 


Lys 


Ala 


Gly Leu Thr Lys 




180 










185 








190 


Gly Leu Gly 


Leu 


Gly 


Ser 


Met 


His 


Cys 


Val 


Leu 


Phe 


Leu Ser Trp Ala 


195 










200 










205 


Leu Leu Val 


Trp 


Phe 


Thr 


Ser 


Val 


Val 


Val 


His 


Lys 


Asp He Ala Asp 


210 








215 










220 




Gly Gly Lys 


Ser 


Phe 


Thr 


Thr 


Met 


Leu 


Asn 


Val 


Val 


He Ala Gly Leu 



5 
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225 








230 










235 










240 


Ser 


Leu 


Gly 


Gin Ala 


Ala 


Pro 


Asp 


He 


Ser 


Ala 


Phe 


Val 


Arg 


Ala 


Lys 








245 










250 










255 




Ala 


Ala 


Ala 


Tyr Pro 


He 


Phe 


Lys 


Met 


He 


Glu 


Arg 


Asn 


Thr 


Val 


Thr 








260 








265 










270 






Lys 


Thr 


Ser 


Ala Lys 


Ser 


Gly 


Arg 


Lys 


Leu 


Gly 


Lys 


Val 


Asp 


Gly 


His 






275 








280 










285 








lie 


Gin 


Phe 


Lys Asp 


Ala 


Thr 


Phe 


Ser 


Tyr 


Pro 


Ser 


Arg 


Pro 


Asp 


Val 




290 








295 










300 










Val 


He 


Phe 


Asp Arg 


Leu 


Asn 


Leu 


Ala 


He 


Pro 


Ala 


Gly 


Lys 


He 


Val 


305 








310 










315 










320 


Ala 


Leu 


Val 


Gly Gly 


Ser 


Gly 


Ser 


Gly 


Lys 


Ser 


Thr 


Val 


He 


Ser 


Leu 








325 










330 










335 




He 


Glu 


Arg 


Phe Tyr 


Glu 


Pro 


He 


Ser 


Gly 


Ala 


Val 


Leu 


Leu 


Asp 


Gly 








340 








345 










350 






Asn 


Asn 


He 


Ser Glu 


Leu 


Asp 


He 


Lys 


Trp 


Leu 


Arg 


Gly 


Gin 


He 


Gly 






355 








360 










365 








Leu 


Val 


Asn 


Gin Glu 


Pro 


Ala 


Leu 


Phe 


Ala 


Thr 


Thr 


He 


Arg 


Glu 


Asn 




370 








375 










380 










lie 


Leu 


Tyr 


Gly Lys 


Asp 


Asp 


Ala 


Thr 


Ala 


Glu 


Glu 


He 


Thr 


Arg 


Ala 


385 








390 










395 










400 


Ala 


Lys 


Leu 


Ser Glu 


Ala 


He 


Ser 


Phe 


He 


Asn 


Asn 


Leu 


Pro 


Glu 


Gly 






405 










410 










415 




Phe 


Glu 


Thr 


Gin Val 


Gly 


Glu 


Arg 


Gly 


He 


Gin 


Leu 


Ser 


Gly 


Gly 


Gin 








420 








425 










430 






Lys 


Gin 


Arg 


He Ala 


He 


Ser 


Arg 


Ala 


lie 


Val 


Lys 


Asn 


Pro 


Ser 


He 






435 








440 










445 








Leu 


Leu 


Leu 


Asp Glu 


Ala 


Thr 


Ser 


Ala 


Leu 


Asp 


Ala 


Glu 


Ser 


Glu 


Lys 




450 








455 










460 










Ser 


Val 


Gin 


Glu Ala 


Leu 


Asp 


Arg 


Val 


Met 


Val 


Gly 


Arg 


Thr 


Thr 


Val 


465 








470 










475 










480 


Val 


Val 


Ala 


His Arg 


Leu 


Ser 


Thr 


Val 


Arg 


Asn 


Ala 


Asp 


He 


He 


Ala 








485 










490 










495 




Val 


Val 


His 


Glu Gly 


Lys 


He 


Val 


Glu 


Phe 


Gly 


Asn 


His 


Glu 


Asn 


Leu 








500 






505 








510 






He 


Ser 


Asn 


Pro Asp 


Gly 


Ala 


Tyr 


Ser 


Ser 


Leu 


Leu 


Arg 


Leu 


Gin 


Glu 






515 








520 










525 








Thr 


Ala 


Ser 


Leu Gin 


Arg 


Asn 


Pro 


Ser 


Leu 


Asn 


Arg 


Thr 


Leu 


Ser 


Arg 




530 








535 










540 










Pro 


His 


Ser 


He Lys 


Tyr 


Ser 


Arg 


Glu 


Leu 


Ser 


Arg 


Thr 


Arg 


Ser 


Ser 


545 








550 










555 










560 


Phe 


Cys 


Ser 


Glu Arg 


Glu 


Ser 


Val 


Thr 


Arg 


Pro 


Asp 


Gly 


Ala 


Asp 


Pro 








565 










570 










575 




Ser 


Lys 


Lys 


Val Lys 


Val 


Thr 


Val 


Gly 


Arg 


Leu 


Tyr 


Ser 


Met 


He 


Arg 








580 








585 










590 






Pro 


Asp 


Trp 


Met Tyr 


Gly 


Val 


Cys 


Gly 


Thr 


He 


Cys 


Ala 


Phe 


lie 


Ala 






595 








600 










605 








Gly 


Ser 


Gin Met Pro 


Leu 


Phe 


Ala 


Leu 


Gly 


Val 


Ser 


Gin 


Ala 


Leu 


Val 




610 








615 








620 










Ser 


Tyr 


Tyr Ser Gly 


Trp 


Asp 


Glu 


Thr 


Gin 


Lys 


Glu 


He 


Lys 


Lys 


He 


625 








630 










635 










640 


Ala 


He 


Leu 


Phe Cys 


Cys 


Ala 


Ser 


Val 


He 


Thr 


Leu 


He 


Val 


Tyr 


Thr 








645 










650 










655 




He 


Glu 


His 


He Cys 


Phe 


Gly 


Thr 


Met 


Gly 


Glu 


Arg 


Leu 


Thr 


Leu 


Arg 








660 








665 










670 






Val 


Arg 


Glu 


Asn Met 


Phe 


Arg 


Ala 


He 


Leu 


Lys 


Asn 


Glu 


He 


Gly 


Trp 






675 








680 










685 








Phe 


Asp 


Glu Val Asp 


Asn 


Thr 


Ser 


Ser 


Met 


Leu 


Ala 


Ser 


Arg 


Leu 


Glu 




690 








695 










700 










Ser 


Asp 


Ala 


Thr Leu 


Leu 


Lys 


Thr 


He 


Val 


Val 


Asp 


Arg 


Ser 


Thr 


He 


705 








710 










715 










720 


Leu 


Leu 


Gin 


Asn Leu 


Gly 


Leu 


Val 


Val 


Thr 


Ser 


Phe 


He 


He 


Ala 


Phe 
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725 










730 


735 




lie Leu 


Asn 


Trp 
740 


Arg 


Leu 


Thr 


Leu 


Val 
745 


Val Leu Ala Thr Tyr 
750 


Pro 


Leu 


Val He 


Ser 


Gly 


His 


He 


Ser 


Glu 


Lys 


Leu Phe Met Gin Gly 


Tyr Gly 




755 








760 




765 






Gly Asp 


Leu 


Asn 


Lys 


Ala 


Tyr 


Leu 


Lys 


Ala Asn Met Leu Ala 


Gly Glu 


770 










775 






780 






Ser Val 


Ser 


Asn 


He 


Arg 


Thr 


Val 


Ala 


Ala Phe Cys Ala Glu 


Glu Lys 


785 






790 








795 




800 


lie Leu 


Glu 


Leu 


Tyr 
805 


Ser 


Arg 


Glu 


Leu 


Leu Glu Pro Ser Lys 
810 


Ser 
815 


Ser 


Phe Arg 


Arg 


Gly 
820 


Gin 


He 


Ala 


Gly 


Leu 
825 


Phe Tyr Gly Val Ser 
830 


Gin 


Phe 


Phe He 


Phe 
835 


Ser 


Ser 


Tyr 


Gly 


Leu 
840 


Ala 


Leu Trp Tyr Gly Ser 
845 


Thr 


Leu 


Met Asp 


Lys 


Gly 


Leu 


Ala 


Gly 


Phe 


Lys 


Ser Val Met Lys Thr 


Phe 


Met 


850 










855 






860 






Val Leu 


lie. 


Val 


Thr 


Ala 


Leu 


Ala 


Met 


Gly Glu Thr Leu Ala 


Leu 


Ala 


865 








870 








875 




880 


Pro Asp 


Leu 


Leu 


Lys 


Gly 


Asn 


Gin 


Met 


Val Ala Ser Val Phe 


Glu 


He 






885 










890 


895 




Leu Asp 


Arg Lys 


Thr 


Gin 


He 


Val 


Gly 


Glu Thr Ser Glu Glu 


Leu 


Asn 




900 










905 


910 






Asn Val 


Glu Gly 


Thr 


He 


Glu 


Leu 


Lys 


Gly Val His Phe Ser 


Tyr 


Pro 




915 










920 




925 






Ser Arg 


Pro Asp 


Val 


Val 


He 


Phe 


Arg 


Asp Phe Asp Leu He 


Val Arg 


930 










935 






940 






Ala Gly 


Lys Ser 


Met 


Ala 


Leu 


Val 


Gly 


Gin Ser Gly Ser Gly 


Lys 


Ser 


945 








950 








955 




960 


Ser Val 


He 


Ser 


Leu 


He 


Leu 


Arg 


Phe 


Tyr Asp Pro Thr Ala 


Gly Lys 








965 










970 


975 




Val Met 


He 


Glu 
980 


Gly 


Lys 


Asp 


He 


Lys 

.985 


Lys Leu Asp Leu Lys 
990 


Ala 


Leu 


Arg Lys 


His He 


Gly 


Leu 


Val 


Gin 


Gin 


Glu Pro Ala Leu Phe 


Ala 


Thr 




995 










1000 


1005 






Thr He 


Tyr Glu 


Asn 


lie 


Leu 


Tyr 


Gly 


Asn Glu Gly Ala Ser 


Gin 


Ser 


1010 








1015 




1020 






Glu Val 


Val 


Glu 


Ser 


Ala 


Met 


Leu 


Ala 


Asn Ala His Ser Phe 


He 


Thr 


1025 








1030 






1035 




1040 


Ser Leu 


Pro 


Glu 


Gly 


Tyr 


Ser 


Thr 


Lys 


Val Gly Glu Arg Gly 


Val 


Gin 



1045 1050 1055 

. Met Ser Gly Gly Gin Arg Gin Arg He Ala He Ala Arg Ala lie Leu 



1060 1065 1070 

Lys Asn Pro Ala He Leu Leu Leu Asp Glu Ala Thr Ser Ala Leu Asp 

1075 1080 1085 

Val Glu Ser Glu Arg Val Val Gin Gin Ala Leu Asp Arg Leu Met Ala 

1090 1095 1100 

Asn Arg Thr Thr Val Val Val Ala His Arg Leu Ser Thr He Lys Asn 
1105 1110 1115 1120 

Ala Asp Thr He Ser Val Leu His Gly Gly Lys He Val Glu Gin Gly 

1125 1130 1135 

Ser His Arg Lys Leu Val Leu Asn Lys Ser Gly Pro Tyr Phe Lys Leu 

1140 1145 " 1150 

He Ser Leu Gin Gin Gin Gin Gin Pro 
1155 1160 

<210> 9 
<211> 986 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> synthetic consensus sequence 
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Met Asp 


Glu 


Gly 


Ala 


Leu Ser Asp 


Arg 


Lys 


Lys 


Lys Val Gly Val Leu 


1 








5 






10 




15 


Phe Arg 


Tyr 


Ala 


Asp 


Trp Asp Lys 


Leu 


Met 


Leu 


Gly Thr Leu Ala Ala 








20 






25 






30 


He 


He 


His 


Gly 


Ser 


Leu Pro Leu 


Met 


Met 


He 


Val Phe Gly Glu Met 






35 




40 








45 


Thr Asp 


Phe 


Ala 


Ser 


Lys Leu Glu 


Glu 


Glu 


Met 


Thr Arg Tyr Ala Tyr 




50 








55 








60 


Tyr Tyr 


Ser 


Gly 


Leii 


Gly Ala Gly 


Val 


Leu 


Val 


Ala Tyr He Gin Val 


65 










70 






75 


80 


Ser 


Trp 


Leu 


Ala 


Ala 


Gly Arg Gin 


He 


Arg 


Lys 


He Arg Lys Phe Phe 










85 






90 




95 


His 


Ala 


He 


Leu 


Arg 


Gin Glu lie 


Gly 


Trp 


Phe 


Asp He Thr Gly Glu 








100 






105 






110 


Leu 


Asn 


Thr 


Arg 


Leu 


Thr Asp Asp 


He 


Ser 


Lys 


He Asn Asp Gly He 






115 






120 








125 


Gly Asp 


Lys 


Val 


Gly 


Met Phe Phe 


Gin 


Val 


Ala 


Thr Phe Leu Ala Gly 




130 








135 








140 


Phe 


He 


Val 


Gly 


Phe 


He Gly Trp 


Lys 


Leu 


Thr 


Leu Val He Leu Ala 


145 








150 




155 


160 


He 


Ser 


Pro 


He 


lie 


Gly Leu Ser 


Ala 


Ala 


Val 


Trp Ala Lys He Leu 








•• 


165 




170 




175 


Ser 


Phe 


Ser 


Lys 


Glu 


Leu Ala Tyr 


Ala 


Lys 


Ala 


Gly Ala Val Ala Glu 








180 






185 






190 


Glu 


Leu 


Gly 


Ala 


He 


Arg Thr Val 


He 


Ala 


Phe 


Gly Gly Gin Lys Glu 






195 






200 








205 


Leu 


Glu 


Arg 


Tyr 


Gin 


Lys Leu Glu 


Ala 


Lys 


Lys 


He Gly He Lys Lys 




210 








215 








220 


Ala 


He 


Ser 


Ala 


He 


Ser Met Gly 


Ala 


Phe 


Leu 


Leu He. Tyr Ala Ser 


225 










230 






235 


240 


Tyr 


Ala 


Leu 


Ala 


Phe 


Trp Tyr Gly 


Ser 


Thr 


Leu 


Val He Ser Glu Tyr 










245 




250 




255 


Thr 


lie 


Gly 


Ala 


Met 


Thr Val Phe 


Phe 


Ser 


He 


Leu He Gly Ala Phe 






260 






265 






270 


Ser 


Val 


Gly 


Gin 


Ala 


Ala Pro He 


Asp 


Ala 


Phe 


Ala Asn Ala Arg Gly 






275 






280 








285 


Ala 


Ala 


Tyr 


lie- 


Phe 


Lys He He 


Asp 


Asn 


Pro 


Ser He Asp Ser Phe 




290 








295 








300 


Ser 


Gly 


His 


Lys 


Pro 


Asp He Lys 


Gly 


Asn 


Leu 


Glu Phe Lys Asp Val 


305 










310 






315 


320 


His 


Phe 


Ser 


Tyr Pro 


Ser Arg Glu 


Val 


Lys 


lie 


Leu Lys Gly Leu Asn 










325 






330 




335 


Leu 


Lys 


Val 


Ser Gly 


Gin Thr Val 


Ala 


Leu 


Val 


Gly Ser Gly Cys Gly 








340 






345 






350 


Lys 


Ser 


Tnr 


Thr 


Val 


Gin Leu lie 


Gin 


Arg 


Leu 


Tyr Asp Pro Glu Gly 






355 






360 








365 


Val 


He 


Asp 


Gly Gin 


Asp He Arg 


Thr 


He 


Asn 


Val Arg Tyr Leu Arg 




370 








375 








380 


Glu 


He 


He 


Gly Val 


Val Ser. Gin 


Glu 


Pro 


Val 


Leu Phe Ala Thr Thr 


385 










390 






395 


400 


lie 


Ala 


Glu 


Asn 


He 


Tyr Gly Arg 


Asp 


Val 


Thr* 


Met Asp Glu He Glu 










405 






410 




415 


Lys 


Ala 


Val 


Lys 


Glu 


Ala Asn Ala 


Tyr 


Glu 


Phe 


He Met Lys Leu Pro 








420 






425 






430 


Phe 


Asp 


Thr 


Leu 


Val 


Gly Glu Arg 


Gly 


Ala 


Gin 


Leu Ser Gly Gly Gin 






435 






440 








445 


Lys 


Gin 


Arg 


He 


Ala 


He Ala Arg 


Ala 


Leu 


Val 


Arg Asn Pro Lys lie 




450 








455 








460 


Leu 


Leu 


Leu 


Asp Glu 


Ala Thr Ser 


Ala 


Leu 


Asp 


Thr Glu Ser Glu Ala 
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.Val 


Val 


Gin 


Ala 


Ala 


His 


Arg 


Leu 








500 


Glu 


Asp 


Gly 


Val 






515 




Gly 


Val 


Tyr 


Phe 




530 






Met 


Lys 


Ser 


Leu 


545 








Val 


Ser 


Phe 


Trp 


Val 


Gly 


Thr 


Val 








580 


lie 


Leu 


Ser 


He 






595 




Ala 


Val 


Val 


Pro 




610 






Leu 


Gly 


Asn 


Ala 


625 








Glu 


Ala 


lie 


Glu 


Glu 


Ser 


Met 


Tyr 








660 


Tyr 


Gly 


lie 


Thr 






675 




Gly 


Cys 


Phe 


Arg 




690 






lie 


Leu 


Val 


Phe 


705 








Ser 


Ser 


Phe 


Ala 


Phe 


Leu 


lie 


Glu 








740 


Leu 


Glu 


Gly. 


Val 






755 




Val 


Leu 


Gin Gly 




770 






Leu 


Val 


Gly Ser 


785 








Glu 


Arg 


Phe Tyr 


Glu 


lie 


Lys 


I#s 








820 


Val 


Ser 


Gin 


Glu 






835 




Ala 


Tyr 


Gly Asp 




850 






Ala 


Asn 


lie His 


OUJ 








Gly 


Asp 


Lys Gly 


Ala 


lie 


Ala Arg 








900 


Ala 


Thr 


Ser Ala 






915 




Leu 


Asp 


Lys 


Ala 




930 






Leu 


Ser 


Thr 


He 


945 








Val 


Lys 


Glu 


His 



470 






Leu Asp 


Lys 


Ala 


485 






Ser Thr 


Val 


Arg 


He Val 


Glu 


Gly 






520 


Lys Leu 


Val 


Met 




535 




Arg Ser 


Ser 


Gin 


550 






Arg Val 


Leu 


Lys 


565 






Cys Ala 


He 


lie 


He Ala 


Val 


Phe 






600 


He lie 


Val 


Val 




615 




Arg Asp 


Lys 


Lys 


630 






Asn He 


Arg 


Thr 


645 






Leu Pro 


Tyr 


Arg 


Phe Ser 


He 


Ser 






680 


Phe Gly 


Ala 


Tyr 




695 




Ser Ala 


He 


Val 


710 






Pro Asp 


Tyr 


Ala 


725 






Arg Pro 


He 


Asp 


Phe Val 


Phe 


Asn 






760 


Leu Ser 


Leu 


Glu 




775 




Ser Gly 


Cys 


Gly 


790 






Asp Pro 


Leu 


Ala 


805 






Leu Asn 


Val 


Gin 


Pro He 


Leu 


Phe 






840 


Asn Ser 


Arg 


Val 




855 




Phe He 


Glu 


Thr 


870 






Thr Gin 


Leu 


Ser 


885 






Ala Leu 


lie 


Arg 


Leu Asp 


Thr 


Glu 






920 


Arg Glu 


Gly 


Arg 




935 




Gin Asn 


Ala 


Asp 


950 






Gly Thr 


His 


Gin 



22, 

475 



Arg 


Gly 


Arg 


Thr 




490 






Asn 


Ala 


Asp 


Val 


505 








Ser 


His 


Asp 


Glu 


Gin 


Thr 


Gly 


He 








540 


Asp 


Arg 


Asp 


Asp 






555 




Leu 


Asn 


Thr 


Glu 




570 






Asn 


Glv 


Leu 


Gin 


585 








Asp 


Asp 


Val 


Lys 


Ala 


Gly 


He 


Val 








620 


Leu 


Glu 


Ala 


Gly 






635 




Val 


Val 


Ser 


Leu 




650 






Asn 


Ser 


Val 


Arg 


665 








Gin 


Ala 


Met 


Tyr 


Leu 


Val 


His 


Gly 








700 


Leu 


Gly 


Ala 


Val 






715 




Lys 


Ala 


Lys 


Leu 




730 






Ser 


Tvr 


Ser 


Glu 


745 








Tyr 


Pro 


Thr 


Arg 


Val 


Lys 


Lys 


Gly 








780 


Lys 


Ser 


Thr 


Val 






795 




Gly 


Lys 


Val 


Leu 




810 






Trp 


Leu 


Arcr 


Ala 


825 








Asp 


Cys 


Ser 


He 


Ser 


Asp 


Glu 


He 








860 


Leu 


Pro 


Asp 


Lys 






875 




Gly 


Gly 


Gin 


Lys 




890 






Gin 


Pro 


He 


Leu 


905 








Ser 


Glu 


Lys 


Val 


Thr 


Cys 


He 


Val 








940 


Leu 


He 


Val 


Val 






955 




Gin 


Leu 


Leu 


Ala 



480 



Thr 


He 


Val 


He 






495 




He 


Ala 


Gly 


Phe 




510 




Leu 


Met 


Lvs 


Lvs 


525 








Asn 


Ser 


Glu 


Ala 


Leu 


Glu 


Val 


Pro 








560 


Trp 


Pro 


Tyr 


Val 






575 




Pro 


Phe 


Ala 


He 




590 






Leu 


Leu 


Leu 


Leu 


605 








Glu 


Met 


Lys 


Leu 


Lys 


He 


Ala 


Thr 








640 


Thr 


Glu 


Lys 


Phe 






655 




Lys 


Ala 


His 


lie 




670 






iriic 




A Y r 


Al a 

/vL <X 


685 








Leu 


Met 


Phe 


Val 


Ala 


Leu 


Gly 


Thr 








720 


Ser 


Ala 


Ala 


Leu 






735 




Gly 


Leu 


Pro 


Asp 




750 






Pro 


Asp 


Val 


Pro 


765 








Gin 


Thr 


Leu 


Ala 


Val 


Gin 


Leu 


Leu 








800 


Leu 


Asp 


Gly 


Lys 






815 




His 


Leu 


Gly 


He 




830 






Ala 


Glu 


Asn 


He 


845 








Val 


Ala 


Ala 


Lys 


Tyr 


Thr 


Arg 


Val 








880 


Gin 


Arg 


He 


Ala 






895 




Leu 


Leu 


Asp 


Glu 




910 






Val 


Gin 


Glu 


Ala 


925 








He 


Ala 


His 


Arg 


lie 


Asn 


Gly 


Lys 








960 


Gin 


Lys 


Gly 


He 
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965 970 975 
Tyr Phe Ser Met Val Val Gin Ala Gly Thr 
980 985 

<210> 10 
<211> 14000 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> misc_f eature 
<222> (3429)... (0) 
<223> Translation start codon 

<300> 

<308> Genbank AP000386 
<309> 1999-08-03 

<400> 10 

caattaatta tatcaaaatt tcgtatatca tttaaaattc atcactgatt tttgtttaga 60 

aaaaaagata gatagctatg gacatgacgt cgaattttaa tatatcctat gtaacagtgt 120 

tcatataatc aaaaaagaaa aaataattac tattgtttgt gttctttgca acaatgcgtt 180 

agaaactcgg agaccacttg ttttttctct tatgatttcg cttcagtatg atttagcaaa 240 

gcatctctac ggtacaaaat atcttacgaa ttgacatgat ctgatccaca ctatatcaag 300 

gagttagaaa gacagaactg aaagcttctt agctcgatat gtatttcaag ctacggttca 360 

caccatctag ggagacaaga atggaagaag gcatgacgtc aagtcgaact atgcagctct 420 

cttacttaaa tgggtcgaca aacaaatacg gaaccaacta tcatccattc gagcaagcgg 480 

tgataggtgg tatgactaag gtctctaaat ttggtttacg gcgaaaaatt aattgatcaa 540 

aaaatttttg gtttagtgtt ttaagatctt tttcaaaaaa acgttgtatt tgatgtataa 600 

acgctatttc tttttgaaat taatttaata ttatttttgt ttgaaaagaa gaagaaacat 660 

aattcaaaca atccctatct ttaatgttcc aacatcttaa taaatacata attaaaactc 720 

tacgatcaat actacacttc tgacgattat taaatcgcat cgtcgtacac tattattaca 780 

aagaccgttc tatctacatt aatattcttt atatattttt ttttttcatt ttttgagatt 840 

gttggagtat atatccttag attttcttgt atataaaaat agataattat aatgggataa 900 

acgaaacatc gttgaagccg tgaagtggca ttggtctaca ccagagcaac acaaaaagac 960 

aaccacttca catggtctgc tcttacattc ataaccgaat ctaagtcact tttagttggt 1020 

tgtaattttg taactattat tccaacacct ctttttttac tttttagatt gttttataat 1080 

agaaatattt taattcctaa attaataatg aaagtaaatg taatatgagt cagtacaata 1140 

tgtgaaaaac ttaaaaagtt gacagaattt agcatttgat taaaagtgta tgaagaagaa 1200 

gaaaaaagaa gatctttttg tatctataga tttagtgcat aacttttctc agattttcga 1260 

tatatacaag aatttaacat aagaaaaatc aagacaaatg gacctggtta taatcggtta 1320 

tctgttgtaa atattatatt tcatattctt ctccacttca taattcttat tggagttcct 1380 

tcaccaaatg tttgatgttc cattaaatta atctaccact ctaataagag gtatcgtact 1440 

acaaattaca cttcataaac aagagaagaa cataaatttg aattttttta aaaaaacata 1500 

tgcgttataa caccaaacag taacggacta gctgatcctt gaatttatat tagttgcaaa 1560 

aatttatata tcgaaattga aacatgaatt tttaaaatta ttagaaaatg tatgatgttg 1620 

tctaaatgtg acattacaaa tacatgatgt tgtttaaaaa ttattataaa acaaccaaag 1680 

tttggcgtca ttctggtaaa cgcacttaat gaatttatat acagttaaag attttaacat 1740 

caaattttaa aatgacaact taactaaaat ttgtatccta atatttttac tagagaaact 1800 

cacacatatt tttcaaacaa atgattagtt atatatcctt cgataatagg tattgtgtaa 1860 

aactgtgtgg tttgcacaag tgatccctcg atattctttc tgctaaagat cgacttccca 1920 

cagtttcgat atctcgggtt tgggtgcaat agcatatgct tgtttagtat gcagataatc 1980 

gtatgagaga gtcagagaga tcatctattt ttcatgatat ggtctgagct aatcgatcat 2040 

ttgttattta tactcgataa cgttctaatt tgtatgattt ttcagccttg atctatcaca 2100 

aaatggagat aatgaaatgg tagtcataag aaaggtaatg atccccttga catgcttatt 2160 

ataacacaaa aagtaatcgc tactgattag cttaccctat gatttgaatc atataacttt 2220 

atactaattg tctaagtgtg atgatataat gtatatgatg tcatttataa cttattagta 2280 

aaaatataaa gttcttcatc attgtcataa ggagttaatt ggatatacat caaaaaattt 2340 

cctaaatttt tagaaattat taatcaagtt ataatagaaa ttattaaata aaatatatga 2400 

tgttttctaa atgtgacgat acaaaataca tgatgttgtt caaaacttat tagacagaaa 2460 

ccaaagtaca tcatcattgt gatgaagact ttaatggatt tatacatttc aaagttttaa 2520 

acattactta tttaaatgaa aaattgagct aaaatttgta tacttatatg ttgaccatag 2580 
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aaatacttct atataatttt agacaagttg tctatacata tatatatata tatatatata 2640 

tatatatatg tatttacaca aaaatcaagt tttcttaatg taacaatata aaaatataaa 2700 

atatctaatg ttgtttaaaa attatttaga ccaaaaaaaa aaaaaactgt aatcaaactt 2760 

tcattggatt atatctaact ctaaacatga atttttatac aatgacatgt tcaataaaat 2820 

tcccatttca tagtttgacc agaaaaaatg cttgcagatt tgttaaaatc tctacttatg 2880 

caaaaccatt gaatcacacc agattttgat ttgtagtact ttaaaaatat taattgaaca 2940 

caacataact ttaacaaaaa agatctttat aaaaaaaaaa ctttctaaaa tgacattaat 3000 

taacacaaat tagtcaattc aaacaatcca ataaaatcaa agtaaaccct taattgtttt 3060 

ttttttccat tttacaaaat gttaaacatt aattacatca actagcaaca cgaatctctc 3120 

aatctctcat gtctcaatga gtctttgacc actagtcttc atcactctct ctccctccta 3180 

tatataccaa tcacaccctt ctccaaccac actctcattc tctctctctt ctcaccactt 3240 

caaaacaaga cactcgtaaa agagttcttg aacttcacaa aacaattgtc agattttcaa 3300 

gaaaaacttt ataaaacaaa aaacatttca ttctttctct ctctctctct cactgctcaa 3360 

tgatctcgtt ttctcactaa accaactcgt ttcttcttac tttctttaac tcggatctac 3420 

aaaaaaccat gtcggaaact aacacaaccg atgccaagac tgttccagca gaagcagaga 3480 

agaagaaaga acagagttta ccattcttta aactcttttc tttcgctgat aaatttgatt 3540 

atctcttaat gttcgttggt tctcttggtg ccattgttca tggctcttcc atgcctgtct 3600 

tctttttact ctttggtcaa atggttaatg gatttggtaa aaaccaaatg gatttacatc 3660 

aaatggttca tgaagtctct agagtaagtc ttttttttct ttcttgtttt tatcaaatta 3720 

gatccaagat ctgatctaat ttttgtgttt gtgaaatttt gcagtattct ctatatttcg 3780 

tctacttggg tttggtcgtt tgcttctctt cttacgcagg tttgtttctt acaaatccaa 3840 

tcttcttttt ctttattctt caatcaagaa acccagtaat taaatcacat tataaacaag 3900 

atcaagaaat tttttgcaca aaaaaaaaag aaaacaagat caagaaatta ataaagttta 3960 

gattaaacat ttgcattatt ttcttttttt tggttcaacc aattaaccca ttaacataaa 4020 

ttaaatcaga ttattaggca agaaatctaa aggtgtattt tttgagtttt atttgattct 4080 

tcctcaaagt gaccatttac tctgaaggta aaacattttt ttttactcta caacaaaaac 4140 

ccaacttctt tttttgtttg cttcaaaatt atattaaaaa aagaaagctt aaactttgtt 4200 

taaaatcttg ttttttttgt cattcgcttt tgattagaac taaaaaaacc atttttatag 4260 

aatgttgttt acataagtga tgtaatgggg ttcggaacga actttccgtg ccaaccgctt 4320 

ttatagacaa gaaatctaaa ggtgtaattt ttgagttttt atttgattct tccacaaagt 4380 

gatcatttac tctggtaaac atttttttac tctacaacaa aaacccaact tttttatgtt 4440 

tgcttcaata aataaaaacc tagaaacaga aatcacaatc tagagagaat caaaattata 4500 

tataaaaaaa aaacacttca aatttttaaa aattttaata tgttaaaagg ataagccaag 4560 

tccacgtgat tcatggacta ctactttgtc tatatcgaaa aaaaaaaaaa aatgggcatc 4 620 

tctctcacat ttattacaac ttcataaaaa ttcttgtaat aataaccata attttttgtt 4680 

aaataatttt acagagatag catgttggat gtattctgga gaaagacaag tagcagcatt 4740 

aaggaagaaa tatcttgaag cagtattaaa acaagacgtt gggttctttg atactgatgc 4800 

tagaactggt gacattgtct ttagtgtttc tactgatact cttcttgttc aagatgccat 4860 

tagtgaaaag gtaaccattt ttttttttac taccaacaaa atctagtatt taccggcggg 4920 

tcccattttt ttcatgatca atctttatca aagttttggg tccgtttatg ttagcaaccg 4980 

tcgtttatct tgcgcctcag gtggttaggg aagaaaaaag aetatagatt gatttttttt 504,0 

tttgttaaaa taaaagttgg atttttttca taagattgtg aaaataagta attttttttt 5100 

acaaatttgc tatagaaata gaggattttt ttcactattt aactaaaact taaatgatat 5160 

agtactcatt ttcttacatt tttaatataa aaatagcttc tttttttcca ttgtgaaaaa 5220 

tatttattat gcaaaataat gtttctttaa caagtaaaat aaatatttgt tattaaaaaa 5280 

aatgtgtaaa gtaccttcaa attttaatag tacattttat ctaaaactgt tttttttttt 5340 

gcaatttggt cctgtctctg tggtgcacgt tagcaagcac gagctaacat gcactgggtc 5400 

ccctagtttc attattatca tcactctttt ttattgttga tgaagtgaat taataacaga 5460 

gtagttgaaa aaaaattgaa tttctcaggt tggaaacttt atacattacc tctcaacatt 5520 

tttggcggga ttagtagttg gatttgtatc agcatggaaa ttagctttgt taagtgttgc 5580 

tgtgattccc ggaatcgctt tcgccggagg tttatacgct tatacactca ccggaattac 5640 

ttcaaagagc cgtgaatctt atgctaacgc cggtgttatc gccgagcagg taaataaaga 5700 

ttaaagtttg ttcctttatt tgttttattt aatgcccacc aactgttcga cgtaatgctt 5760 

tttaactgtt cttaggcttg cttctttgta cttgatctct caaatgattt tttttttttt 5820 

tcattgcttc ttatgcaatc caaagactta aacagtgttt tcactaattc aaaatctttg 5880 

cttttctttg gtcggttaat gattgatgtt ttggtttctc aggcaattgc tcaagttcga 5940 

actgtttatt cttatgttgg agagagtaag gcacttaatg cgtattcgga tgcgattcag 6000 

tatacgctta agctcggtta taaagcgggg atggctaaag ggttgggttt aggatgtact 6060 

tatggaatag cttgtatgtc atgggctttg gtgttttggt atgctggagt ttttattcgg 6120 

aatggacaaa ccgatggagg aaaggcgttt actgctatat tctctgctat tgttggtgga 6180 

atgtaagaat tcaaaacata acaatgatgt ggttttgtgt attttgtgtt ttgttattac 6240 

ttggatgtca ctttttgttt ctgtgtgtgt ttttaggagt ttggggcaat ctttctcgaa 6300 
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tcttggggcg tttagtaaag gtaaagcggc tggttataag ttgatggaga taattaacca 6360 

gagaccgacg ataattcaag acccgttgga tggaaaatgt . ttggatcaag ttcatgggaa 6420 

cattgagttt aaagatgtga cttttagcta tccttcacgg cctgatgtta tgatcttcag 6480 

gaactttaat attttcttcc cttctgggaa aactgtggcg gttgttggtg ggagtggctc 6540 

tggaaagagt actgttgttt ccctcattga gagattctat gatccaaaca gcggtaattt 6600 

gattgaattt gttttttgct atttgagtct ttggctagtg actggatcat aactttgttt 6660 

attttttctt gatgagcagg gcaaattctg ttggatggtg ttgagataaa gacgcttcag 6720 

ttgaagtttt tgcgtgaaca aatcgggctt gtgaatcaag aacctgcgct ctttgccact 6780 

actatactag agaacatact ctatggaaag cctgatgcaa caatggttga agttgaagct 6840 

gctgcttccg ctgcgaatgc gcatagtttc attacattac ttcctaaagg ctacgacaca 6900 

caggtataaa tcaaatattt gattatatgt ctagttaatg tcatggcctt ttgcttaatt 6960 

ttttgttgaa tgtcaataca attaggttgg agaacgtggt gttcaactct caggtggaca 7020 

gaagcagaga attgcaattg ctagggcgat gttgaaagac ccaaagattc tgttactaga 7080 

tgaagctaca agcgctcttg atgctagctc tgagagcatt gttcaggaag ctttagacag 7140 

agtcatggtg gggaggacca ctg-ttgttgt tgctcatcgt ctctgcacca tcagaaatgt 7200 

tgattccatt gccgtgatac agcaaggcca agttgttgaa accggaacac atgaagaact 7260 

cattgccaaa tccggtgctt acgcatccct catcaggttt caggaaatgg ttggtactcg 7320 

agatttctca aacccgtcaa ctcgtcgcac tcgttcaacc cgtttgagcc attcactgtc 7380 

aacgaaatca ctcagtttaa gatcaggaag tttgaggaat ctgagctatt cttacagcac 7440 

tggagctgat ggtcggatag agatgatttc aaatgcagag actgaccgaa agactcgtgc 7500 

ccctgaaaat tacttctaca ggcttctcaa gcttaattca ccggaatggc cttactcaat 7560 

catgggagca gtaggctcaa ttctttctgg tttcattggt cctacatttg ctattgtgat 7620 

gagcaacatg atcgaagtct tctactacac agactatgat tcaatggaaa ggaaaacaaa 7680 

agagtatgtc ttcatctaca ttggtgctgg tctctatgct gtgggtgctt atttgatcca 7740 

acattacttc tttagcatca tgggagaaaa cctcacaaca agagtaagaa gaatgatgct 7800 

ctcaggtatg tatcaaaatc tcctgaattt gcttaaaatc actttcccat ttcttatttt 7860 

ggtttcttga ttgttcttat ttagctatct tgagaaacga agttggttgg ttcgatgagg 7920 

atgaacacaa ctcaagcctg atcgctgcac gtttagctac tgatgcagca gatgttaaat 7980 

ccgctatagc cgagagaatc tcagtaattc tacaaaacat gacttcactt ctcacatcct 8040 

tcatagtcgc cttcatagta gaatggagag tctcacttct catcttaggc acattcccac 8100 

ttctagtcct cgctaacttt gctcaggtaa ataatctaat ctttttactc aaaatctttc 8160 

aatattcatc aatcattaaa atataattgg aatcttgcat tcaccattat gatcttaaga 8220 

aaaaacgaca aaaggccaga tttttataaa tttatatttg cttttcaaaa gttcaaaact 8280 

ttatagaaca tggcaacaca gccactgcct ctacacgtgc ttcatcttct aactttatcc 8340 

aagtttgcat ttatgtttac aataataatc ataaaagaat tattaagaag cttttttttt 8400 

ctactttttg gaaatagtgt aagggtcaag atcatggagc cctcacatca ataaatgtgc 8460 

taaaaaaatt aaaaaacagt aggctttagt ttactctctg ggcatgtgtg aagaatattt 8520 

attatatagt ttctattggt actatgaccc atagataaca gtgttcacga aaatagctaa 8580 

gattcctctg tcttttgctt ctgctaaatc tatcctactt taagctttca tatttacttc 8640 

actctctgaa ctctgaactg tgatcccact tctctcttta tttaattctt ttgcccataa 8700 

aacctcacca caaaaatcca aaaaatctgc aatttttttc cttcttagaa ccaatatttt 8760 

atttagagtt cttcattggt caaagttgtt gtctcagtgc attatttact tatcacagtg 8820 

tgtgtgtcag tgtttttaca ccatccacta gtcaatgttt gcttgtgggt ttctttgttt 8880 

tggtagatta ggttgtgatg agtttttttt ttgtttctaa ctagctgcaa ggttcaggac 8940 

tctgctttga tatatcacca acattttttc acccgtgatc taattattag ttgaaaaatc 9000 

tatcaaatag atttcacaca gaagaacata gataggcatt gtacttgtac tgatgttgat 9060 

gggatagagt gttgcatatg tgatttaact atatggttct acgtcatgtt ttagtggcga 9120 

cttagacctt tgattgtcaa tcttattttt tacaagtgaa ctattattac catctgttgt 9180 

tctaaatcat aagtattaat atatgtggct acattgcagc aactatctct gaagggtttt 9240 

gctggagaca cagctaaggc tcatgcaaag. acttcaatga ttgctggtga aggagtcagt 9300 

aacattagaa ccgtagcagc tttcaatgca cagagcaaga ttctctcttt gttctgtcat 9360 

gagcttcgtg tacctcagaa aagaagctta taccgaagtc aaacctcggg tttcctattt 9420 

ggcctctcgc agcttgctct ctatggttct gaggctttaa ttctctggta tggtgcccac 9480 

cttgtgagta aaggcgtgtc aaccttttcc aaagtgatca aagtgtttgt ggttttggtc 9540 

attactgcaa actctgttgc tgaaactgtc agtcttgctc ctgaaattat tcggggaggt 9600 

gaagctgttg gttcggtttt ctcggtcttg gacaggcaga ccaggattga cccggatgat 9660 

gctgatgctg atcccgtgga gacgatccgt ggagacattg agtttaggca tgttgatttc 9720 

gcttaccctt caagacccga cgtcatggtt ttcagggact ttaacctcag aattcgagct 9780 

ggacatagcc aagctcttgt gggcgcgagt gggtcaggga agagttctgt aattgcgatg 9840 

atcgagcggt tttacgaccc tcttgctgga aaagtcatga ttgatggcaa agacatccgc 9900 

cggctaaacc tgaaatctct aaggctcaaa atcggtcttg ttcaacaaga accagctctc 9960 

ttcgcagcaa cgatcttcga caacatcgcc tatggtaaag atggtgcaac tgaatccgag 10020 
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gtaattgatg cagctcgagc cgcaaatgct cacggtttca tcagtggttt acctgaaggt 10080 

tacaaaactc cagtaggcga aagaggagtg cagttatcag gtggacagaa acagaggatc 10140 

gcgatagcaa gagctgtgct caagaaccct acagtgttgc ttctagacga agcaactagc 10200 

gcactagatg cagaatcaga atgcgtgctg caagaggcgt tagagaggct catgagaggt 10260 

cggaccaccg tggtagttgc tcaccgcttg tccaccataa gaggtgttga ttgcattggt 10320 

gtgattcaag acgggcggat tgtggagcaa ggcagccatt cagagctcgt tagccgacca 10380 

gagggagctt attcaaggct gttacagctt caaacacata ggatttgaag cttgatcatg 10440 

gattaaaaac aaaaaatcgg tttgtgtaat tttttttata ttaaaacttt aatttggaag 10500 

atttctatgg actataacga taatatgaat aggtgtagat aatgaagctt ttggagtgtt 10560 

tatgaaggtt ctttaattaa gggttatttt tttcgcattt tgcttatgtg cccgttttgg 10620 

aacataagac tgaactatgt tttcgttatg tttttaattt atgcctcgaa acaaaacaaa 10680 

atcactctaa tactttggtt gatcaaaatt tccctaaaac atttggttca tgactagaat 10740 

tgatcggaca tgttctttta ggggtttgat ctcgcatggt ctaatcaggt tatgtggtct 10800 

tacagcgact taatcaattt agaccggcta tcaaaattag attcagccta aacagcaaaa 10860 

ttgtcttata taatatccta aatcacagac cggatttttt ttttttataa actctatcgg 10920 

ctgatccggt caaccccggt agaaacgcac tgaagtgttt catagtggac ttcaaaccga 10980 

aattttgcag aaattgagat gagaatgagc tctacctcgc attgacatgt tatttcgttc 11040 

ttgagatctt aagtaatttt ttgagatctt aagtaatttt ttggtattca agttccttat 11100 

ttttttgcgg ttactcgtct tgacatgtta tttcgttctt gtgctactta cataccacaa 11160 

tatatgtagg tttctaaaca tatataatag gaatgttgtt ctatatatgt gttgtttgat 11220 

acgtaaagag taatctggtt ctacgtagag tctatctctt gtttcttggt gagattttga 11280 

ctggactgca tggtaatcct cgcgcatggt gtttagttac cggttgcttc attcatttgg 11340 

ccacgctcaa attttctaga agactcttcg gtctttggta caaaggcctc cgatttttcc 11400 

atgacaagtt cattcatatt tttttgtgac aacttgattg ttgttcattt ccgaatattc 11460 

catgaacgta tcaaactcaa taaaatcgat tgttgtcact ccctttaaga tgagatcctc 11520 

aacggcagct tcaccactag ccatgttttc acacatgatc aactatctca catttttact 11580 

aaggctttaa gcaggaatat tacttctaga attttaacaa tctagaattt ctagaggaaa 11640 

tgttcacatg tatatatttt ggtatatgat ctagagaact tatataagat tatcaatgtt 11700 

taatatactc aacaaatggg ttcttctatt ttatcttcta attgttgggg ccttgaatta 11760 

tttagttctt tttcactatt ccttttcttt tatttggtgt gtggagccat aaagttccaa 11820 

taatttctat gaaaagaaat gaataaaatc taaagaaacc aactgtgaag ctctttaatc 11880 

ttattctatc tgttggatct gtttgccttt atttgctttg catgtggttt tgctgtttaa 11940 

gataaaggcc atatcttctc ctaattcttt ttccaaatca acacatcaaa tatataaaag 12000 

actccaaata ttttgaatag aatcataatt tatttaatta caaaaaaact taaagaattc 12060 

aagacctgcg atcacaagac aaatacatgt aaggaattaa atatttaaga taccaaccgc 12120 

taataaataa gactcacaat ttctaaaaat taaaacgaaa aaaagaaaaa attagtcatc 12180 

attgcatatc ttagtgaaca aaaatcaata agaaaaggat aatatacttt ttaccaacat 12240 

cacataaacc taaaagcatc tcacaccaaa agtaaattaa aaactcagaa aaagctgaat 12300 

acaagttatg aattttatgg aaagtgttag tgtgttagtg ttgttatcat tacaacttta 12360 

aaataattat gaacatgaaa aaatagattg aacatgaagc agcattggga aggactcgaa 12420 

tgtccctact catactgaca aaacctcaag tgtrtccagtg cgttaggttt cactttccag 12480 

tagtatgtgt atctgctttt tttgtatata ttaatgtaat agatagtgta gagaatcatg 12540 

aggaacattt tgtaatcata gaactaaggc ataattttga tcgtgtgaaa taatactatt 12600 

atttttggta acaaggatga tctggaccga attccgacta attttccagg tgaatatagt 12660 

catcaagtaa cttgatcacc ggattttaaa aatgaattat gaacatgata tcatttcaac 12720 

taatcaaata aaaatgaaaa aattcatgtg ggaatcatat ccctcaaatc tgttaccacc 12780 

aaactacttt catttggttt gtgtgagaaa tataagaatt atcaaacaaa gaaaaaggtt 12840 

ttgcttaatt ataatgatat gaggttcatc aaatttatat accactaaaa tattcctaca 12900 

ctcatccaaa tattattctc gcttatgcag ttttaatcta atgtacaaat catatccata 12960 

agcgcattgg ccgatgctaa aattttggca ttagttgttg atttgttttt cttgagtcct 13020 

atgttcaaaa aatatatttg ataatttaat gtataaatta tacccataat cacattgatc 13080 

aatgcttaaa gttttgtatt agttgttgat tttttgtttg tttgcgtgtg ttgaattttc 13140 

ataaatgaaa agtaaatacg ataatcatat ttcaaaacgt aaaggtgtta attagtagtc 13200 

taaatggtaa aatataaaca gtatattata aaaattacta aaatggtttt gtaaaaaaaa 13260 

tatgctattt gtattataat gaaattcaaa aattttaaat agaacgtata atttctgcac 13320 

aaagaggttt tgaggtgtta taattcatga agtaaatttt atttactgac gggtaagttg 13380 

tgaaaaagtt tgaagactat tttttttgct tttcacagag aaaactactt tcctttttaa 13440 

ttgtatgatg agaaggcaaa agtgcagaca tgtgctttct tttctcccat tttcaacaat 13500 

gtcactcgtt gtattattca tattttagca aactggttat atctatatct atcaatcatt 13560 

tcagaacatc atatccatca gtttttggac attgctacat acgttagtat tgatgtacca 13620 

gttaccctaa caggcttttg catagtgtgg cagaacacgt gaggtgtgat atatgcggat 13680 

gaattctatg ttctgcattt tgttaccatt catataaaag tattgtttta gttgtgctgc 13740 
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ggtttaagtc ctaatacaga tttatttgta gtgttatgtt tataaaccac attggaaccc 13800 

aaactctaag agaatataag atcaatgtgt aattaataaa ttttatagtc attcggataa 13860 

aacttagaaa gataacaaaa gtaagaacga gtatttttaa gcgaatactc tttagatatt 13920 

ctttgtattg atcattattg acaaaccatt aaacattttt gtagacccta aaatccgata . 13980 

tccaaaaaac aattcttatt 14000 



<210> 11 
<211> 29 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 11 

cactgctcaa tgatctcgtt ttctcacta 29 

<210> 12 
<211> 30 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 12 

cttgaatcac accaatgcaa tcaacacctc 30 

<210> 13 
<211> 29 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 13 

cattttataa taacgctgcg gacatctac 29 

<210> 14 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 14 

tttctccata ttgaccatca tactcattg. 29 



BNSOOCID: <WO 0O18888A1 I > 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCTAJS99/2236S— 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) : Please See Extra Sheet 

US CL :Please See Extra Sheet 
According to International Patent Classification (IPC) or to both national classification and IPC 

R FIELDS SEARCHED 

Minimum docomentation searched (classification system followed by classification symbols) 

US. : 800/27*. 294. 300; 435*9.1. 7L2, 4*8, 419. 2523; 320.1; 536723.6. 24.1 

Documentation searckod other than minimum documentation to the extent that such documents arc included in the fields searche d 



Electronic data base consulted during the international search (name of data base and. where practicable, search terms used) 
Please See Extra Sheet 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 


Citation of document with indication, where appropriate, of the relevant passages 


Relevant to clsim Ho. 


X 


DUDLER ET AL. Structure of an mdr-like Gene from Aratridopsis 
thaliana. The Journal of Biological Chemistry. March 1992, Vol. 
267, No. 9, pages 5882-5888, see pages 5883, 5885, and 5888. 


24, 29-30 


Y 


1-6 


Y 


CHO et al. An Anion Channel in Aratridopsis Hypocotyk Activited 
by Blue light. Proc. Natl. Acad. Sci. USA. July 1996, Vol. 93, 
pages 8134-8138, see page 8134. 


1-2 


X 


EM YR DA VIES et al. Cloning and Characterization of a Novel P~ 
Glycoprotein Homologue from Barley . Gene. June 1997, Vol. 199, 
pages 195-202, see whole document. 


24,29-30 


Y 


1-6 


fx) Furtl 


>er documents are listed in die continuation of Box C. f^jj See patent family annex. 





Spwal MMforiH of cited 4 



to bo of pwtMulv ivWmim 



•X* 




otod to wtabtafa tfto po b ti o tiOD dsto of 
son (o ^MOsfisd) 

Twftt i ui£ to so oral dnolassso, on, w hib a io o or odfctr 



t of f w rtkuhr nhmot; cfanziod k*»« 
OOMMtolod to mnokf ■> mvaotiv* step mhmi (ft 
oaoib&Md with oso or moro othor such doewws, r 
boon obvious to m paraoo akjDed in d» art 



of Ami 




ttaiafsmOr 

international search report 

27 J^'2000 



Date of the actual completion of the mtomatxmal search 
23 DECEMBER 1999 



/ 



Name and mi 



address of me ISA/US 



Box PCT 

Wastnngton, ttC 30231 
Facsimile No. (703)305-3230 




(703) 308-0196 



Form PCT/lSA/210 (second sbeetX»ry 1992)* 

BNSOOCta <WO 0Ot8886Al_l_> 



INTERNATIONAL SEARCH REPORT 



International a pplicati on No. 
PCT/US99/22363 



C (Continuation). DOCUMENTS CONSIDERED TO BE RELEVANT 



Categnty* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



SIDLER et al. Involvement of an ABC Transporter in a 
Developmental Pathway Regulating Hypocotyl Cell Elongation in 
me light The Plant Cell. October 1998, Vol. 10, pagesH 623- 1636; 
see pages 1623 and 1629-1634. 

TOMMASINI et al. Differential Expression of Genes Coding for 
ABC Transporters after Treatment of Arabidopsis thaliana with 
Xenobiotics. FEBS Letters. May 1997, Vol. 411, pages 206-210, 
see page 206. 



US 5,786, 162 A ( CORBISIER et al) 28 July 1998, see whole 
document 

US 5,073,677 A (HELMER et al) 17-December 1991, see whole 
document 



24,28-31 



1-6, 9-23 



1-6,24 



1-6, 9-24, 28-31 
1-6, 9-24, 28-31 



Fonn PCT/lSA/210 (cootmulkm of second iboetXJaly 1992)* 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US99/^3oT 



Bos I OfciemdoM where certain daima were found smseerckabts (OmtinuatJoa of item 1 off first sheet) 
This nteraationeJ report has not been established a respect of certain claims under Article I7(2X») for the fioOowmg reasons: 

1. I I Claims Noaj 

^ became they relate to subject matter not required to be searched by this Authority, namely: 

2. r~| Claims Nosj 

— because tbey relate to parts of the international application that do not comply with the pfcscribcd requirements to such 
an extent mat no meaningful mternatxmal search can be carried out, specifically: 

* 

3. Q Claims Nos^ 

because they are dependent chums and are not drafted in a ccor dan ce with the second and third s e n tences of Rule &4(a> 

Box II Observations where unity of in Ten don Is lacking (Continuation of item 2 of first sheet) 
This International Searching Authority found multiple inventions in mis international application, as follows: 
Please See Extra Sheet 



1. Q As ail required additional search fees were timery paid by the applicant. thU international search report covers all searchable 

claims. 

2. Q As ail searchable claims could be searched without effort justifying an additional fee, mis Authority did not invite payment 

of any additional fee. 

3. | | As only some of the required additional search fees were timery paid by the applicant, this international search report covers 

only those claims for which fees were paid, specifically claims Nosj 



4 * Qa No rwwircd •rfWitwo*! *carch fees were timery paid by the applicant Consequently, this mternational search report b 
restricted to the invention first mentioned in the claims; it is covered by claims No*.: 
1-6. 9-24, 2W1 



Remark en Pretest | | The additional search fees were accompanied by the applicant's protest 

I I No protest accompanied the payment of additional search fees. 



Form PCT/1SA/210 (continuation of first sheet(i)XJuly 1992)* 

BNSOOCta <WO___0018886A1_I_> 



INTERNATIONAL SEARCH REPORT 



Inte rna tional application No. 
PCT/US99/22363^ 



A, CLASSIFICATION OF SUBJECT MATTER: 
IPC (6): 

C12N SAM. 15/00. 15/09, 15/11, 15/29, 15/63, 15/74. 15/81. 15/82 ; A01H 5/00 

A. CLASSIFICATION OF SUBJECT MATTER: 
USCL : 

800/278. 294. 300; 435/69.1. 71.2. 468. 419. 2523, 320.1; 536723.6. 24.1 

B. FIELDS SEARCHED 

Electronic data bases consulted (Name of data base aad where practicable terms used): 
DIALOG. WESTI_2a 

SEARCH TERMS: MDR-UKE GENES. P-GLYCOPROTEIN GENES. ARABIDOPS1S. NPPB. XENOBIOTIC. 
RESISTANT PLANTS. ABC TRANSPORTER, AtPGPl EXPRESSION. TRANSGENIC PLANT 

BOX 1L OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKINO 
This ISA found multiple invention a as follows: 

This application contains the following inventions or groups of inventions which are not so linked as to form a single 
inventive concept under PCT Rule 13.1. In order for all 'inventions to be searched, the appropriate additional search foes 
must be paid. 

Group I. claim(s)l-6, 9-24. 28-31. drawn to an isolated nucleic acid in a recombinant expression cassette, a vector 

comprising it, a transgenic plant, and a method for producing a plant with enhanced resistance to xenobiotic compounds. 

Oroup n. cUim(s) 7-8. 25-26, 32-38. drawn to an isolated protein and antibodies for the protein. 

Group HI. claim(s) 27, drawn to an oligonucleotide. 

Oroup IV. daim(s) 39-40, drawn to P-grycoprotem gene promoter. 

Group V. claim(s) 41-45, drawn to a plant with nutated pIPAC gene and a method of making it 

The inventions listed *s Oroups I-V do not relate to a single inventive concept under PCT Rule 13.1 because, under PCT 
Rale 13.2, they lack the same or corresponding special technical features for the following reasons: 

The claimed isolated nucleic acid molecules aad transformed ecus are anticipated by each of Dudkr et al, Emyr Davis et 
at, aad Staler et al. as set forth in the Search Report, and so do not constitute a single special technical feature which 
would be an advance over the prior art 

The mvention of Group I, drawn to a first product and process of use, requires an isolated nucleic arid encoding P- 
grycopfotein. a vector, host cells, and a method for plant transformation and regeneration not required by any other 
group. 

The invention of Group II, drawn to a second product, requires an isolated polypeptide and antibodies for the 
polypeptide not required by any other group. 

The invention of Group HI, draws to a third product requires an oligonucleotide and a hybridization technique not 
required by any other group. 

The mvention of Group IV. drawn to a fourth product requires a specific gene promoter not required by any other 
group. 

The mvention of Group V, drawn to a fifth product and method of use, requires a plant with mutated pfPAC gene and a 
method of making it not required by any other group. 



Form PCTASA/210 (extra sheetXJury 1992)* 

BNSOOCJDt <WO 0018886A1J. > 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 



I2J LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 



IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




